FIGURE l 



GGGGCTTCGGCGCCAGCGGCCAGCGCTAGTCGGTCTGGTAAGGATTTACAAAAGGTGCAGGTATG 
AGCAGGT C TGAAGAC T AACAT T T TGTGAAG T TGTAAAAC AGAAAACC T GT TAGA AATG TGG T GGT 
TTCAGCAAGGCCTCAGTTTCCTTCCTTCAGCCCTTGTAATTTGGACATCTGCTGCTTTCATATTT 
TCATACATTACTGCAGTAACACTCCACCATATAGACCCGGCTTTACCTTATATCAGTGACACTGG 
TACAGTAGCTCCAGAAAAATGCTTATTTGGGGCAATGCTAAATATTGCGGCAGTTTTATGCATTG 
C TACCAT T TATGTT CGTTATAAGCAAGT T CATGC TC T GAGT CC TGAAGAGAACG T TAT CATCAAA 
TTAAACAAGGCTGGCCTTGTACTTGGAATACTGAGTTGTTTAGGACTTTCTATTGTGGCAAACTT 
s>*% CCAGAAAACAACCCTTTTTGCTGCACATGTAAGTGGAGCTGTGCTTACCTTTGGTATGGGCTCAT 
fl TATATATGTTTGTTCAGACCATCCTTTCCTACCAAATGCAGCCCAAAATCCATGGCAAACAAGTC 
fil TTCTGGATCAGACTGTTGTTGGTTATCTGGTGTGGAGTAAGTGCACTTAGCATGCTGACTTGCTC 
*B ATCAGT T T T GCACAG T GGCAAT T TT GGGAC TGAT T T AGAACAGAAAC T CCAT TGGAAC CCCGAGG 

ACAAAGGTTATGTGCTTCACATGATCACTACTGCAGCAGAATGGTCTATGTCATTTTCCTTCTTT 
GGTTTTTTCCTGACTTACATTCGTGATTTTCAGAAAATTTCTTTACGGGTGGAAGCCAATTTACA 
T GGATTAACCCT CTATGACACTGCACCTTGCCC TAT TAACAATGAACGAACACGGCTACTTT CCA 
GAGATATT TGAT GAAAGGATAAAAT AT TTCTGTAAT GAT TATGATTCTCAGGGATTGGGGAAAGG 
TTCACAGAAGTTGCTTATTCTTCTCTGAAATTTTCAACCACTTAATCAAGGCTGACAGTAACACT 
GAT GAAT GC TGAT AAT C AGGAAACATGAAAGAAGCCAT T T GATAGAT TAT T C TAAAGGATAT CAT 



Pi. 

m 



f*k C AAGAAGAC TAT T AAAAAC ACC TAT GC C TAT AC T T T T T TAT C T CAGAAAAT AAAGTCAAAAGAC T 



ATG 
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FIGURE 2 



THY? . 



7 -fr 



<subunit 1 of 1, 2 66 aa, 1 stop 
<MW: 29766, pi : 8.39, NX(S/T}: 0 

MWWFQOGLSFLPSALVIWTSAAFIFSYITAVTLHHIDPALPYISDTGTVAPEKCLFGAMLNIAAV 
L C I AT I YVR YKQ VHAL S PEENV 1 1 KLNKAG L VLG I L S C LGL S I VAN FQKT T L FAAHVS GAVL T FG 
MGSLYMFVQTILSYQMQPKIHGKQVFWIRLLLVIWCGVSALSMLTCSSVLHSGNFGTDLEQKLHW 
NPEDKGYVLHMITTAAEWSMSFSFFGFFLTYIRDFQKISLRVEANLHGLTLYDTAPCPINNERTR 
LLSRDI 

Important features: 

Type II transmembrane domain: 

amino acids 13-33 

Other Transmembrane domains: 

amino acids 54-73, 94-113, 160-180, 122-141 

N-myristoylation sites. 

amino acids 57-63, 95-101, 99-105, 124-130, 183-189 
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FIGURE a 



CGGACGCGTGGGCGGACGCGTGGGGGAGAGCCGCAGTCCCGGCTGCAGCACCTGGGAGAAGGCAGACC 
GTGTGAGGGGGCCTGTGGCCCCAGCGTGCTGTGGCCTCGGGGAGTGGGAAGTGGAGGCAGGAGCCTTC 
CTTACACTTCGCCATGAGTTTCCTCATCGACTCCAGCATCATGATTACCTCCCAGATACTATTTTTTG 
GATTTGGGTGGCTTTTCTTCATGCGCCAATTGTTTAAAGACTATGAGATACGTCAGTATGTTGTACAG 
GTGATCTTCTCCGTGACGTTTGCATTTTCTTGCACCATGTTTGAGCTCATCATCTTTGAAATCTTAGG 
AGTATTGAATAGCAGCTCCCGTTATTTTCACTGGAAAATGAACCTGTGTGTAATTCTGCTGATCCTGG 
H TTT TCAT GGTGCCTTTTTACATTGGCTAT TTTATTGTGAGCAATATCCGACTACTGCATAAACAACGA 

w CTGCTTTTTTCCTGTCTCTTATGGCTGACCTTTATGTATTTCTTCTGGAAACTAGGAGATCCCTTTCC 
!l! CAT T CT CAGCC CAAAACAT GGGAT CTTAT CCAT AGAACAGCT CATCAGCCGGGTT GGTGT GATT GGAG 

Jtj TGACTCTCATGGCTCTTCTTTCTGGATTTGGTGCTGTCAACTGCCCATACACTTACATGTCTTACTTC 
m CTCAGGAAT GT GACTGACACGGAT ATTCT AGCC CTGGAACGGCGACTGC T GCAAACCAT GGATAT GAT 

g CATAAGCAAAAAGAAAAGGATGGCAATGGCACGGAGAACAATGTTCCAGAAGGGGGAAGTGCATAACA 
ff| AACCATCAGGTTTCTGGGGAATGATAAAAAGTGTTACCACTTCAGCATCAGGAAGTGAAAATCTTACT 
? CTTATTCAACAGGAAGTGGATGCTTTGGAAGAATTAAGCAGGCAGCTTTTTCTGGAAACAGCTGATCT 
K ATATGCTACCAAGGAGAGAATAGAATACT CCAAAACC TTCAAGGGGAAATATTTTAATTTTCTTGGTT 

ACTTTTTCTCTATTTACTGTGTTTGGAAAATTTTCATGGCTACCATCAATATTGTTTTTGATCGAGTT 
GGGAAAACGGATCCTGTCACAAGAGGCAT TGAGATCACT GTGAATTATCTGGGAATCCAATTTGATGT 
GAAGTTT T GGT CCCAACACAT TT C CTT CATTC T T GT T GGAATAAT C AT CGT CACAT C CAT CAGAGGAT 
TGCTGATCACTCTTACCAAGTTCTTTTATGCCATCTCTAGCAGTAAGTCCTCCAATGTCATTGTCCTG 
CTATTAGCACAGATAATGGGCATGTACTT TGTC TCCT CT GTGCTGCT GATCCGAAT GAGTATGCCTTT 
AGAATAC CGCACCATAATCACTGAAGTCC TTGGAGAACT GCAGT TCAAC TT CTATCACC GTTGGT TTG 
ATGTGATCTTCCTGGTCAGCGCTCTCTCTAGCATACTCTTCCTCTATTTGGCTCACAAACAGGCACCA 
GAGAAGCAAAT GGCACCTTGAACT TAAGCCTAC TACAGACTGT TAGAGGCCAGTGGTTT CAAAAT TTA 
GATATAAGAGGGGGGAAAAATGGAACCAGGGCCTGACATTTTATAAACAAACAAAATGCTATGGTAGC 
ATTTTTCACCTTCATAGCATACTCCTTCCCCGTCAGGTGATACTATGACCATGAGTAGCATCAGCCAG 
AACATGAGAGGGAGAACTAACTCAAGACAATAC T CAGCAGAGAGCAT CC C GTGTGGATAT GAGGCTGG 
TGTAGAGGCGGAGAGGAGCCAAGAAACTAAAGGTGAAAAATACACTGGAACTCTGGGGCAAGACATGT 
CTAT GGT AGCT GAG C C AAAC ACG T AG GAT TTCCGTTT TAAGGT TCACAT GGAAAAGGTTATAGCT TTG 
CCTTGAGATTGACTCATTAAAATCAGAGACTGTAACAAAAAAAAAAAAAAAAAA71AAGGGCGGCCGCG 
ACTCTAGAGTCGACCTGCAGAAGCTTGGCCGCCATGGCCCAACTTGTTTATTGCAGCTTATAATG 
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FIGURE 4. 



MSFLIDSSIMITSQILFFG FGWL F FMRQL FKD Y E I RQ Y WQ V I FS VT FAFS C TMFE L 1 1 FE I LG V 
LNSSSRYFHWKMNLCVILLILVFMVPFYIGYFIVSNIRLLHKQRLLFSCLLWLTFMYFFWKLGDP 
FPILSPKHGILSIEQLISRVGVIGVTLMM.LSGFGAVNCPYTYMSYFLRNVTDTDILALERRLLQ 
TMDMIISKKKRMAMARRTMFQKGEVHNKPSGFVJGMIKSVTTSASGSENLTLIQQEVDALEELSRQ 
LFLETADLYATKERIEYSKTFKGKYFNFLGYFFSIYCVWKIFMATINIVFDRVGKTDPVTRGIEI 
TVNYLG I QFDVKFWS QH I S FI LVGI 1 1 VTS I RGLL I TL TKFFYAI S S SKS SNVI VLLLAQ IMGMY 
FVS S VLL I RMSMPLE YRT I ITEVLGELQFNFYHRWFDVI FLVSALSS ILFLYLAHKQAPEKQMAP 

HI Important features: 

||| Signal peptide: 

^ amino acids 1-23 

Potential transmembrane domains : 

fjj amino acids 37-55, 81-102, 150-168, 283-311, 338-356, 375-398, 

II 425-444 

!J N-glycosylation sites. 

?|§ amino acids 67-70, 180-183 and 243-246 

Eukaryotic cobalamin-binding proteins 

amino acids 151-160 
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FIGURE s 



AG C AG G GAAAT C CGGATG T C T C G G T TATGAAG T G G AG C AG T GAGT GT GAG C C TCAACAT AG T T C C 
AGAACTCTCCATCCGGACTAGTTATTGAGCATCTGCCTCTCATATCACCAGTGGCCATCTGAGGT 
GTTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTTGCTTCTCACT 
TCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCTTTCCATCCAGGT 
GTCATGCAGAAT TATGGGGATCACCCT TGTGAGCAAAAAGGCGAACCAGCAGCTGAAT T TCACAG 
AAGCTAAGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAGGACCAAGTTGAAACAGCC 
TTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGATGGATTCGTGGTCATCTCTAG 
GATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTGTCCTGATTTGGAAGGTTCCAGTGA 
GCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGATACTTGGACTAACTCGTGCATTCCAGAA 
ATT AT CACCACCAAAGATC CCAT AT TCAACAC TC AAAC TGCAACACAAACAACAGAAT T T AT TGT 
CAGTGACAGTACCTACTCGGTGGCATCCCCTTACTCTACAATACCTGCCCCTACTACTACTCCTC 
C TGC T CCAGCT T CC AC T TC TAT T CCAC GGAGAAAAAAAT T GAT T TGT G TCACAGAAGT T T TT AT G 
GAAACTAGCACC AT G T CTACAGAAAC T GAACCAT T T G T TGAAAATAAAGCAGCAT T CAAGAATGA 
AGCTGCTGGGTTTGGAGGTGTCCCCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTG 
CAGCTGGTCTTGGATTTTGCTATGTCAAAAGGTATGTGAAGGCCTTCCCTTTTACAAACARGAAT 
CAGCAGAAGGAAAT GATCGAAAC CAAAGTAGTAAAGGAGGAGAAGGC CAAT GATAGCAAC CC TAA 
TGAGGAATCAAAGAAAACTGATAAAAACCCAGAAGAGTCCAAGAGTCCAAGCAAAACTACCGTGC 
GAT GCC T GGAAGC T GAAGT T TAGA T GAGACAGAAAT GAGGAGACACACCTGAGGC TGGT T TCT T T 
CATGCTCCTTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCA 
CCCT TGGTTCCTAACT GGAATCAGCTCAGGAC TGCCATT GGAC TATGGAGT GCACCAAAGAGAAT 
GCCCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCCT 
TTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAAGGAC 
CTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGGTGGGTTG 
AAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGACCCTTTCTTCA 
GCTCTGAAAGAGAAACACGTATCCCACCTGACAT GTCC TT CTGAGCC CGGTAAGAGCAAAAGAAT 
GGCAGAAAAGT T TAGCCCC TGAAAGCCATGGAGATTCT CATAACTTGAGACCTAATCTCTGTAAA 
GCT AAAATAAAGAAATAGAACAAGGC T GAGGATACGACAGTACAC TGTCAGCAGGGAC T GTAAAC 
ACAGACAGGGTCAAAGTGT TTTCTCTGAACACAT TGAGTT GGAATCACTGT T TAGAACACACACA 
CTTACTTTTTCTGGTCTCTACCACTGCTGATATTTTCTCTAGGAAATATACTTTTACAAGTAACA 
AAAATAAAAAC TCTTATAAATT T CTATT TT TATC TGAGTT ACAGAAATGATTAC TAAGGAAGAT T 
ACT CAG TAAT T T G T T T AAAAAGT AAT AAAAT T CAAC AAAC AT T T GC T G AAT AGC TAC TAT AT G T C 
AAGTGC TGT GCAAGG TAT T ACACTCT GTAAT T GAAT AT TAT T CC T CAAAAAAT T GC AC AT AG TAG 
AACGCTATCTGGGAAGCTATTTTTTTCAGTTTTGATATTTCTAGCTTATCTACTTCCAAACTAAT 
T T T TAT T T T T G C T G AGAC TAAT C T TAT T CAT T T T C T C T AATATGG C AAC CAT TAT AAC C T TAAT T 
TAT TAT T AACAT ACCT AAGAAG TAC AT TGT TACC T C T ATATACC AAAG C AC AT T T T AAAAG T GC C 
AT T AAC AAAT G T AT CAC T AGCCC T C CT T T T T CCAAC AAG AAGG GAC T GAGAGAT GC AGAAAT AT T 
TGT GAC AAAAAAT T AAAGCAT T T AGAAAAC T T 



App. 



ID-10063602 



Page 155 of 318 



FIGURE 6 



MARCFSLVLLLTSIWTTRLLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTEAKEACRLLG 
L S L AGKDQVE TALKAS FE TC S YGWVGDGFWI S R I S PNPKCGKNGVGVL I WKVP VSRQ FAAYCYN 
SSDTWTNSCIPEIITTKDPIFNTQTATQTTEFIVSDSTYSVASPYSTIPAPTTTPPAPASTSIPR 
RKKLICVTEVFMETSTMSTETEPFVENKAAFKNEAAGFGGVPTALLVLALLFFGAAAGLGFCYVK 
RYVKAFPFTNKNQQKEMIETKWKEEKANDSNPNEESKKTDKNPEESK3PSKTTVRCLEAEV 

Signal sequence : 

Q amino acids 1-16 

Transmembrane domain: 

amino acids 235-254 

| 

III N-giycosylation site. 

2i amino acids 53-57, 130-134, 289-293 

C3 

f% Casein kinase II phosphorylation site. 

bf amino acids 145-149, 214-218 

^ Tyrosine kinase phosphorylation site. 

amino acids 79-88 

N-myristoylation site. 

amino acids 23-29, 65-71, 234-240, 235-239, 249-255, 253-259 
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FIGURE 7 



CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGCAGCCCGGC 
GGCCTCCCGGCGGGAGCGAGCAGATCCAGTCCGGCCCGCAGCGCAACTCGGTCCAGTCGGGGCGG 
CGGCTGCGGGCGCAGAGCGGAGATGCAGCGGCTTGGGGCCACCCTGCTGTGCCTGCTGCTGGCGG 
CGGCGGTCCCCACGGCCCCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCG 
GCTCTCAGCTACCCGCAGGAGGAGGCCACCCTCAATGAGATGTTCCGCGAGGTTGAGGAACTGAT 
G GAG GACACGCAGC AC AAAT T GC G C AGCG C GG T GGAAGAGAT G GAGGCAGAAGAAGC TGCTGCTA 
AAGCATCATC AGAAGT GAACC TGGCAAACT TACC TC CCAGC T ATCACAAT GAGACCAACACAGAC 
ACGAAGGT TGGAAATAATACCAT CCATGTGCACCGAGAAAT TCACAAGATAACCAACAACCAGAC 
fa T GGACAAATGGTCT T T TCAGAGACAG TTAT CACATC TGT GGGAGACGAAGAAGGCAGAAGGAGC C 

f*\ ACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCATGTACTGCCAGTTTGCCAGCTTCCAGTAC 
ACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGGGACAGTGAGTGCTGTGGAGACCA 
GCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACCAGGGGCAGCAATGGGACCATCTGTGACA 
ACCAGAGGGACTGCCAGCCGGGGCTGTGCTGTGCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGC 
ACACCCCTGCCCGTGGAGGGCGAGCTTTGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCAC 
IN CTGGGAGCTAGAGCCTGATGGAGCCTTGGACCGATGCCCTTGTGCCAGTGGCCTCCTCTGCCAGC. 
O CCCACAGCCACAGCCTGGTGTATGTGTGCAAGCCGACCTTCGTGGGGAGCCGTGACCAAGATGGG 
GAGATCC T GC T GCC CAGAGAGGT CC CCGAT GAG TAT GAAGT T GGCAGC T T CAT GGAGGAGGT GC G 
CCAGGAGCTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTG 
^ CCGCCGCT GC AC T GC T G GGAGGGGAAGAGAT T TAGA T C T G GAC C AGG C T G T GGG TAGATGT G CAA 

TAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCTTCTTCCTA 
g! CATCTTCTTCCCAGTAAGTTTCCCCTCTGGCTTGACAGCATGAGGTGTTGTGCATTTGTTCAGCT 
W CCCCCAGGCTGTTCTCCAGGCTTCACAGTCTGGTGCTTGGGAGAGTCAGGCAGGGTTAAACTGCA 
III GGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGCCTCTACCAGTTGGCAGACAGCCG 
|3 TTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAGGAGATGGAAACAATGTGGAGTCTCCCTC 
TGATTGGTTTTGGGGAAATGTGGAGAAGAGTGCCCTGCTTTGCAAACATCAACCTGGCAAAAATG 
CAACAAATGAATTTTCCACGCAGTTCTTTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGC 
AGATGAAATGTTCTGTTCACCCTGCATTACATGTGTTTATTCATCCAGCAGTGTTGCTCAGCTCC 
TACCTCTGTGCCAGGGCAGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGG 
AGGGGGTCATTGTTCTCCTCGTCCATCAGGGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCC 
CAAGTCACACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCT 
CTCCACTACCCCACACCAGCCTTGGTGCCACCAAAAGTGCTCCCCAAAAGGAAGGAGAATGGGAT 
T TT TCTTGAGGCATGCACATC TGGAATTAAGG TCAAACTAAT TCTCA.CATCCCTCTAAAAGTAAA 
C TAC TGT TAGGAACAGCAG TG T TCT CACAG TG T GGGGC AGCCG T CC T T C TAAT GAAGACAAT GAT 
ATTGACACTGTCCCTCT T TGGCAGTTGCATTAGTAACTT T GAAAGGTATAT GAC TGAGCGTAGCA 
T ACAGGT TAACC T GCAGAAACAGT ACT TAGGTAAT T GT AGGGC GAGGAT TATAAATGAAAT TTGC 
AAAATC AC T TAGCAG CAAC T GAAGACAAT TATCAAC CACGTGGAGAAAAT CAAACCGAGCAGGGC 
TGTGTGAAACATGGTTGTAATATGCGACTGCGAACACTGAACTCTACGCCACTCCACAAATGATG 
TTTTCAGGTGTCATGGACTGTTGCCACCATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTGCA 
CAT GAT TG TATAAGCATGC T T T CT T T GAG T T T TAAAT TAT GTATAAACATAAGT TGCAT T TAGAA 
AT C AAGCAT AAAT C AC T T CAAC T G C AAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 8 



MQRLG^TLLCLLLAAAVPTAPAPAPTATSAPVKPGPALSYPQEEATLNEMFREVEELMEDTQHKL 
RSAVEEMEAEEAAAKASSEWJ^LPPSYHNETNTDTKVGNNTIHVHREIHKITNNQTGQMVFSE 
TVITSVGDEEGRRSHECIIDEDCGPSMYCQFAS FQYTCQPCRGQRMLCTRDSECCGDQLCVWGHC 
TKMATRGSNGT I CDNQRDCQPGLCCAFQRGLLFPVCT PLPVEGELCHDPASRLLDL ITWELEPDG 
ALDRCPCASGLLCQPHSHSLVYVCKPTFVGSRDQDGEILLPREVPDEYEVGSFMEEVRQELEDLE 
RS LTEEMALGE PAAAAAALLGGEE I 

Signal sequence : 

amino acids 1-19 



Casein kinase II phosphorylation site. 

amino acids 46-50, 67-71, 98-102, 135-139, 206-210, 312-316, 




N-glycosylation site. 

amino acids 96-100, 106-110, 121-125, 204-208 



327-331 



N-myristoylation site. 



amino acids 202-208, 217-223 



Amidation site. 



amino acids 140-144 
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FIGURE Q 



CGGACGCGTGGGCGGACGCGTGGGGGCTGTGAGAAAGTGCCAATAAATACATCATGCA^CCCCAC 
GGCCCACCTTGTGAACTCCTCGTGCCCAGGGCTGATGTGCGTCTTCCAGGGCTACTCATCCAAAG 
GCCTAATCCAACGTTCTGTCTTCAATCTGCAAATCTATGGGGTCCTGGGGCTCTTCTGGACCCTT 
AACTGGGTACTGGCCCTGGGCCAATGCGTCCTCGCTGGAGCCTTTGCCTCCTTCTACTGGGCCTT 
CCACAAGCCCCAGGACATCCCTACCTTCCCCTTAATCTCTGCCTTCATCCGCACACTCCGTTACC 
ACACTGGGTCATTGGCATTTGGAGCCCTCATCCTGACCCTTGTGCAGATAGCCCGGGTCATCTTG 
H 1 GAGTATATTGACCACAAGCTCAGAGGAGTGCAGAACCCTGTAGCCCGCTGCATCATGTGCTGTTT 

m 

CAAGTGCTGCCTCTGGTGTCTGGAAAAATTTATCAAGTTCCTAAACCGCAATGCATACATCATGA 

Q 

? : S TCGCCATCTACGGGAAGAATTTCTGTGTCTCAGCCAAAAATGCGTTCATGCTACTCATGCGAAAC 

IF' 

ATTGTCAGGGTGGTCGTCCTGGACAAAGTCACAGACCTGCTGCTGTTCTTTGGGAAGCTGCTGGT 
GGTCGGAGGCGTGGGGGTCCTGTCCTTCTTTTTTTTCTCCGGTCGCATCCCGGGGCTGGGTAAAG 



W* ACTTTAAGAGCCCCCACCTCAACTATTACTGGCTGCCCATCATGACCTCCATCCTGGGGGCCTAT 

rq 

GTCATCGCCAGCGGCTTCTTCAGCGTTTTCGGCATGTGTGTGGACACGCTCTTCCTCTGCTTCCT 

a 

GGAAGACCTGGAGCGGAACAACGGCTCCCTGGACCGGCCCTACTACATGTCCAAGAGCCTTCTAA 
jjf AGAT T CT GGGCAAGAAGAACGAGGC GCCCC CGGACAACAAGAAGAGGAAGAAGTGACAGCTC CGG 

C| CCCTGATCCAGGACTGCACCCCACCCCCACCGTCCAGCCATCCAACCTCACTTCGCCTTACAGGT 

CTCCATTTTGTGGTAAAAAAAGGTTTTAGGCCAGGCGCCGTGGCTCACGCCTGTAATCCAACACT 
TTGAGAGGCTGAGGCGGGCGGATCACCTGAGTCAGGAGTTCGAGACCAGCCTGGCCAACATGGTG 
AAACCTCCGTCTCTATTAAAAATACAAAAATTAGCCGAGAGTGGTGGCATGCACCTGTCATCCCA 
GCTACTCGGGAGGCTGAGGCAGGAGAATCGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAGCCGA 
GATCGCGCCACTGCACTCCAACCTGGGTGACAGACTCTGTCTCCAAAACAAAACAAACAAACAAA 
AAGAT TT TATTAAAGATAT TTTGTTAACTC 
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FIGURE 10 



RTRGRTRGGCEKVPINTSCNPTAHLVNSSCPGLMCVFQGYSSKGLIQRSVFNLQIYGVLGLFWTL 
NWLALGQCVLAGAFASFYWAFHKPQDIPTFPLISAFIRTLRYHTGSIiAFGALILTLVQIARVIL 
EYIDHKLRGVQNPVARCIMCCFKCCLWCLEKFIKFLNRNAYIMIAIYGKNFCVSAKNAEMLLMRN 
IVRVWLDKVTDLLLFFGKLLWGGVGVLSFFFFSGRIPGLGKDFKSPHLNYYWLPIMTSILGAY 
VIASGFFSVFGMCVDTLFLCFLEDLERNNGSLDRPYYMSKSLLKILGKKNEAPPDNKKRKK 



Important features: 

Q Transmembrane domains 

f*'t 



amino acids 57-80 (type II), 110-126, 215-231, 254-274 



N-glycosylation sites. 

amino acids 16-20, 27-31, 289-293 

Hypothetical YBR002c family proteins . 

amino acids 276-288 

Ammonium transporters proteins. 

amino acids 204-231 

N-myristoylation sites. 

amino acids 60-66, 78-84 

Amidation site. 

amino acids 306-310 



App_ID= 10063602 



Page 160 of 318 



FIGURE 11 



GCCCCGCGCCCGGCGCCGGGCGCCCGAAGCCGGGAGCCACCGCCATGGGGGCCTGCCTGGGAGCCTGC 
TCCCTGCTCAGCTGCGCGTCCTGCCTCTGCGGCTCTGCCCCCTGCATCCTGTGCAGCTGCTGCCCCGC 
CAGCCGCAACTCCACCGTGAGCCGCCTCATCTTCACGTTCTTCCTCTTCCTGGGGGTGCTGGTGTCCA 
TCATTATGCTGAGCCCGGGCGTGGAGAGTGAGCTCTACAAGCTGCCCTGGGTGTGTGAGGAGGGGGCC 
GGGATCCCCACCGTCCTGCAGGGCCACATCGACTGTGGCTCCCTGCTTGGCTACCGCGCTGTCTACCG 
CATGTGCTTCGCCACGGCGGCCTTCTTCTTCTTCTTTTTCACCCTGCTCATGCTCTGCGTGAGCAGCA 
GCCGGGACCCCCGGGCTGCCATCCAGAATGGGTTTTGGTTCTTTAAGTTCCTGATCCTGGTGGGCCTC 
ACCGTGGGTGCCTTCTACATCCCTGACGGCTCCTTCACCAACATCTGGTTCTACTTCGGCGTCGTGGG 
?R CTCCTTCCTCTTCATCCTCATCCAGCTGGTGCTGCTCATCGACTTTGCGCACTCCTGGAACCAGCGGT 
m GGCTGGGCAAGGCCGAGGAGTGCGATTCCCGTGCCTGGTACGCAGGCCTCTTCTTCTTCACTCTCCTC 
14 T T C TAG T T GCT GT CGAT CGC GGCC GTG GC GCT GATGT T CAT GT AC T ACACT GAGCC C AG C GGC T GCCA 

^ CGAGGGCAAGGTCTTCATCAGCCTCAACCTCACCTTCTGTGTCTGCGTGTCCATCGCTGCTGTCCTGC 

o 



CCAAGGTCCAGGACGCCCAGCCCAACTCGGGTCTGCTGCAGGCCTCGGTCATCACCCTCTACACCATG 
TTTGTCACCTGGTCAGCCCTATCCAGTATCCCTGAACAGAAATGCAACCCCCATTTGCCAACCCAGCT 
GGGCAACGAGACAGTTGTGGCAGGCCCCGAGGGCTATGAGACCCAGTGGTGGGATGCCCCGAGCATTG 
TGGGCCTCATCATCTTCCTCCTGTGCACCCTCTTCATCAGTCTGCGCTCCTCAGACCACCGGCAGGTG 
AACAGCC TGAT GCAGACCGAGGAGT GCCCACCT ATGC TAGACGCCACACAGCAGCAGCAGCAGCAGGT 
MS GGCAGCCTGTGAGGGCCGGGCCTTTGACAACGAGCAGGACGGCGTCACCTACAGCTACTCCTTCTTCC 
W ACTTCTGCCTGGTGCTGGCCTCACTGCACGTCATGATGACGCTCACCAACTGGTACAAGCCCGGTGAG 
ACCCGGAAGATGATCAGCACGTGGACCGCCGTGTGGGTGAAGATCTGTGCCAGCTGGGCAGGGCTGCT 
CCTCTACCTGTGGACCCTGGTAGCCCCACTCCTCCTGCGCAACCGCGACTTCAGCTGAGGCAGCCTCA 
CAGCCTGCCATCTGGTGCCTCCTGCCACCTGGTGCCTCTCGGCTCGGTGACAGCCAACCTGCCCCCTC 
CCCACACCAATCAGCCAGGCTGAGCCCCCACCCCTGCCCCAGCTCCAGGACCTGCCCCTGAGCCGGGC 
CTTCTAGTCGTAGTGCCTTCAGGGTCCGAGGAGCATCAGGCTCCTGCAGAGCCCCATCCCCCCGCCAC 
ACCCACACGGTGGAGCTGCCTCTTCCTTCCCCTCCTCCCTGTTGCCCATACTCAGCATCTCGGATGAA 
AGGGCTCCCTTGTCCTCAGGCTCCACGGGAGCGGGGCTGCTGGAGAGAGCGGGGAACTCCCACCACAG 
TGGGGCATCCGGCACTGAAGCCCTGGTGTTCCTGGTCACGTCCCCCAGGGGACCCTGCCCCCTTCCTG 
G AC TTCGTGCCT TAC T GAG T CTCT AAG AC T T T TT C T AAT AAACAAGC CAG T GC GT GT AAAAAAAA 
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FIGURE 12 

MGACLGACSLLSCASCLCGSAPCILCSCCPASRNSTVSRLIFTFFLFLGVLVSIIMLSPGVESQL 
YKLPm/CSEGAGIPTVLQGHIDCGSLLGYRAWRMCFATAAFFFFFFTLLMLCVSSSRDPRAAIQ 
NGFWFFKFLILVGLTVGAFYIPDGSFTNIWFYFGWGSFLFILIQLVLLIDFAHSWNQRWLGKAE 
ECDSRAWYAGLFFFTLLFYLLSIAAVALMFMYYTEPSGCHEGKVFISLNLTFCVCVSIAAVLPKV 
QDAQPNSGLLQASVI TLYTMFVTWSALS S I PEQKCNPHLPTQLGNETWAGPEGYETQWWDAPS I 
VGL 1 1 FLL C T L F I S LRS S DHRQVNS LMQ T EE C P PML DATQQQQQQ VAACE GRAFDNE QDGVT YS Y 

M» SFFHFCLVLASLHVMMTLTNWYKPGETRKM^ - 

iff FS 

| E | Signal sequence: 

|1 amino acids 1-20 

^ Transmembrane domains: 

JU amino acids 40-58, 101-116, 134-150, 162-178, 206-223, 240-257, 

j| 272-283, 324-340, 391-406, 428-444 

ii ' • 

* . ; 

n 
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FIGURE 13 

CGGGCCAGCCTGGGGCGGCCGGCCAGGAACCACCCGTTAAGGTGTCTTCTCTTTAGGGATGGTGA 
GGTTGGAAAAAGACTCCTGTAACCCTCCTCCAGGATGAACCACCTGCCAGAAGACATGGAGAACG 
CTCTCACCGGGAGCCAGAGCTCCCATGCTTCTCTGCGCAATATCCATTCCATCAACCCCACACAA 
CTCATGGCCAGGAT TGAGTCCTATGAAGGAAGGGAAAAGAAAGGCATATCTGATGTCAGGAGGAC 
TTTCTGTTTGTTTGTCACCTTTGACCTCTTATTCGTAACATTACTGTGGATAATAGAGTTAAATG 
T GAATGGAGGCATT GAGAACACA.T T AGAGAAGGAGGT GAT GCAGTAT GAC TACTATT C T T CATAT 
TTTGATATATTTCTTCTGGCAGTTTTTCGATTTAAAGTGTTAATACTTGCATATGCTGTGTGCAG 
ACT GCGCCAT T GGT GGGCAATAGCG T T GACAACGGCAGT GACCAGT GCCT T T T TACTAGCAAAAG 
TGATCCTTTCGAAGCTTTTCTCTCAAGGGGCTTTTGGCTATGTGCTGCCCATCATTTCATTCATC 
C T T GCCT GGAT T GAGACGTGGT T CC TGGATT T CAAAG TGT TACC TCAAGAAGCAGAAGAAGAAAA 
CAGACTCCTGATAGTTCAGGATGCTTCAGAGAGGGCAGCACTTATACCTGGTGGTCTTTCTGATG 
GTCAGTT T TAT TCCCCTCCTGAATCCGAAGCAGGATCTGAAGAAGCTGAAGAAAAACAGGACAGT 
G AG AAACCAC T T T T AGAAC T ATGAG TACT AC T T T TGT T AAATG TGAAAAAC CC TCAC AGAAAG T C 
ATCGAGGCAAAAAGAGGCAGGCAGT GGAGT CTCCCT GTCGACAGTAAAGT T GAAATGGTGAC GT C 
CACTGC TGGCT T TAT TGAACAGCTAATAAAGAT T TAT TTAT TGTAAT ACCT CACAAACGT TGTAC 
CATATCCATGCACATTTAGTTGCCTGCCTGTGGCTGGTAAGGTAATGTCATGATTCATCCTCTCT 
T CAGTGAGAC TGAGCCTGATG T GT TAACAAATAGG T GAAGAAAGTCT TGTGCTGTAT T CCTAAT C 
AAAAGACT TAAT AT ATT GAAGTAACACT T T TT TAG T AAGCAAGATAC CT T T T TAT T T CAAT T CAC 
AGAATGGAATTTTTTTGTTTCATGTCTCAGATTTATTTTGTATTTCTTTTTTAACACTCTACATT 
TCCCTTGTTTTTTAACTCATGCACATGTGCTCTTTGTACAGTTTTAAAAAGTGTAATAAAATCTG 
ACATGTCAATGTGGCTAGTTTTATTTTTCTTGTTTTGCATTATGTGTATGGCCTGAAGTGTTGGA 
CTT GCAAAAGGGGAAGAAAGGAAT T GCGAATACATGTAAAATGTCACCAGACAT TTGTATTATT T 
TTATCATGAAATCATGTTTTTCTCTGATTGTTCTGAAATGTTCTAAATACTCTTATTTTGAATGC 
ACAAAATGACTTAAACCATTCATATCATGTTTCCTTTGCGTTCAGCCAATTTCAATTAAAATGAA 
CTAAATTAAAAA 
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FIGURE 14 

MNHLPEDMENALTGSQSSHASLRNIHSINPTQLMARIESYEGREKKGISDVRRTFCLFVTFDLLF 
VTLLWI IELNVNGGIENTLEKEVMQYDYYSSYFDI FLLAVFRFKVLILAYAVCRLRHWWAIALTT 
AVT S AFLL AKVI LS KL FS Q GAFG Y VL P 1 1 S F I IiAW I E T W FL D FKVLP QE AE E ENRL L I VQDAS E R 
AALIPGGLSDGQFYSPPESEAGSEEAEEKQDSEKPLLEL 

Important features of the protein: 
i^i Signal peptide: 

Q amino acids 1-20 

CI 

Transmembrane domains: 

S amino acids 54-72, 100-118, 130-144, 146-166 

IH N-myristoylation sites. 

^ amino acids 14-20, 78-84, 79-85, 202-208, 217-223 

i' <J 
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FIGURE is 



ACTCGAACGCAGTTGCTTCGGGACCCAGGACCCCCTCGGGCCCGACCCGCCAGGAAAGACTGAGG 
CCGCGGCCTGCCCCGCCCGGCTCCCTGCGCCGCCGCCGCCTCCCGGGACAGRAGATGTGCTCCAG 
GGTCCCTCTGCTGCTGCCGCTGCTCCTGCTACTGGCCCTGGGGCCTGGGGTGCAGGGCTGCCCAT 
CCGGCTGCCAGTGCAGCCAGCCACAGACAGTCTTCTGCACTGCCCGCCAGGGGACCACGGTGCCC 
CGAGACGT GCCACCCGACACGGT GGGGCT G TACG TC T T TGAGAAC GGCAT CACCATGC T CGACGC 
AGGCAGCTTTGCCGGCCTGCCGGGCCTGCAGCTCCTGGACCTGTCACAGAACCAGATCGCCAGCC 
TGCCCAGCGGGGTCTTCCAGCCACTCGCCAACCTCAGCAACCTGGACCTGACGGCCAACAGGCTG 
CATGAAATCACCAATGAGACCTTCCGTGGCCTGCGGCGCCTCGAGCGCCTCTACCTGGGCAAGAA 
CCGCATCCGCCACATCCAGCCTGGTGCCTTCGACACGCTCGACCGCCTCCTGGAGCTCAAGCTGC 
|^ AGGACAACGAGCTGCGGGCACTGCCCCCGCTGCGCCTGCCCCGCCTGCTGCTGCTGGACCTCAGC 
r-i CACAACAGCCTCCTGGCCCTGGAGCCCGGCATCCTGGACACTGCCAACGTGGAGGCGCTGCGGCT 
~K GGCTGGTCTGGGGCTGCAGCAGCTGGACGAGGGGCTCTTCAGCCGCTTGCGCAACCTCCACGACC 
2? TGGATGTGTCCGACAACCAGCTGGAGCGAGTGCCACCTGTGATCCGAGGCCTCCGGGGCCTGACG 
P CGCCTGCGGCTGGCCGGCAACACCCGCATTGCCCAGCTGCGGCCCGAGGACCTGGCCGGCCTGGC 
||| TGCCCTGCAGGAGCTGGATGTGAGCAACCTAAGCCTGCAGGCCCTGCCTGGCGACCTCTCGGGCC 
jf| TCTTCCCCCGCCTGCGGCTGCTGGCAGCTGCCCGCAACCCCTTCAACTGCGTGTGCCCCCTGAGC 
^ TGGTTTGGCCCCTGGGTGCGCGAGAGCCACGTCACACTGGCCAGCCCTGAGGAGACGCGCTGCCA 

CTTCCCGCCCAAGAACGCTGGCCGGCTGCTCCTGGAGCTTGACTACGCCGACTTTGGCTGCCCAG 
*i CCACCACCACCACAGCCACAGTGCCCACCACGAGGCCCGTGGTGCGGGAGCCCACAGCCTTGTCT 
si TCTAGCTTGGCTCCTACCTGGCTTAGCCCCACAGCGCCGGCCACTGAGGCCCCCAGCCCGCCCTC 
Q CACTGCCCCACCGACTGTAGGGCCTGTCCCCCAGCCCCAGGACTGCCCACCGTCCACCTGCCTCA 

ATGGGGGCACATGCCACCTGGGGACACGGCACCACCTGGCGTGCTTGTGCCCCGAAGGCTTCACG 
J GGCCTGTACTGTGAGAGCCAGATGGGGCAGGGGACACGGCCCAGCCCTACACCAGTCACGCCGAG 
U GCCACCACGGTCCCTGACCCTGGGCATCGAGCCGGTGAGCCCCACCTCCCTGCGCGTGGGGCTGC 
Ipl AGCGCTACCTCCAGGGGAGCTCCGTGCAGCTCAGGAGCCTCCGTCTCACCTATCGCAACCTATCG 
PI GGCCCTGATAAGCGGCTGGTGACGCTGCGACTGCCTGCCTCGCTCGCTGAGTACACGGTCACCCA 
5 GCTGCGGCCCAACGCCACTTACTCCGTCTGTGTCATGCCTTTGGGGCCCGGGCGGGTGCCGGAGG 
r# GCGAGGAGGCCTGCGGGGAGGCCCATACACCCCCAGCCGTCCACTCCAACCACGCCCCAGTCACC 

CAGGCCCGCGAGGGCAACCTGCCGCTCCTCATTGCGCCCGCCCTGGCCGCGGTGCTCCTGGCCGC 
GCTGGCTGCGGTGGGGGCAGCCTACTGTGTGCGGCGGGGGCGGGCCATGGCAGCAGCGGCTCAGG 
ACAAAGGGCAGGTGGGGCCAGGGGCTGGGCCCCTGGAACTGGAGGGAGTGAAGGTCCCCTTGGAG 
CCAGGCCCGAAGGCAACAGAGGGCGGTGGAGAGGCCCTGCCCAGCGGGTCTGAGTGTGAGGTGCC 
ACTCATGGGCTTCCCAGGGCCTGGCCTCCAGTCACCCCTCCACGCAAAGCCCTACATCTAAGCCA 
GAGAGAGACAGGGCAGCTGGGGCCGGGCTCTCAGCCAGTGAGATGGCCAGCCCCCTCCTGCTGCC 
ACACCACGTAAGTTCTCAGTCCCAACCTCGGGGATGTGTGCAGACAGGGCTGTGTGACCACAGCT 
GGGCCCTGTTCCCTCTGGACCTCGGTCTCCTCATCTGTGAGATGCTGTGGCCCAGCTGACGAGCC 
CTAACGTCCCCAGAACCGAGTGCCTATGAGGACAGTGTCCGCCCTGCCCTCCGCAACGTGCAGTC 
CCTGGGCACGGCGGGCCCTGCCATGTGCTGGTAACGCATGCCTGGGTCCTGCTGGGCTCTCCCAC 
TCCAGGCGGACCCTGGGGGCCAGTGAAGGAAGCTCCCGGAAAGAGCAGAGGGAGAGCGGGTAGGC 
GGCTGTGTGACTCTAGTCTTGGCCCCAGGAAGCGAAGGAACAAAAGAAACTGGAAAGGAAGATGC 
TTTAGGAACATGTTTTGCTTTTTTAAAATATATATATTTATAAGAGATCCTTTCCCATTTATTCT 
GGGAAGATGT TT TT CAAAC TCAGAGAC AAGGAC TTTGGTTTT T GT AAGACAAACGATGAT AT GAA 
GGCCTTTTGTAAGAAAAAATAAAAGATGAAGTGTGAAA 
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FIGURE 16 



MCSRVPLLLPLLLLLALGPGVQGCPSGCQCSQPQTVFCTARQGTTVPRDVPPDTVGLYVFENGIT 
MLDAGSFAGLPGLQLLDLSQNQIASLPSGVFQPLANLSNLDLTANRLHEITNETFRGLRRLERLY 
LGKNRIRHIQPGAFDTLDRLLELKLQDNELRALPPLRLPRLLLLDLSHMSLLALEPGILDTANVE 
ALRLAGLGLQQLDEGLFSRLRNLHDLDVSDNQLERVPPVIRGLRGLTRLRLAGNTRIAQLRPEDL 
AGLAAIQELDVSNLSLQALPGDLSGLFPRLRLLA^ 

TRCHFPPKNAGRLLLELDYADFGCPATTTTATVPTTRPWREPTALSSSLAPTWLSPTAPATEAP 
SPPSTAPPTVGPVPQPQDCPPSTCLNGGTCHLGTRHHLACLCPEGFTGLYCESQMGQGTRPSPTP 
VTPRPPRSLTLGIEPVSPTSLRVGLQRYLQGSSVQLRSLRLTYRNLSGPDKRLVTLRLPASLAEY 
TVTQLRPNATYSVCVMPLGPGRVPEGEEACGEAHTPPAVHSNHAPVTQAREGNLPLLIAPALAAV 
LLAALAAVGAAYCVRRGRAMAAAAQDKGQVGPGAGPLELEGVKVPLEPGPKATEGGGEALPSGSE 
CEVPLMGFPGPGLQS PLHAKPYI 

Important features: 
Signal peptide: 

amino acids 1-23 
Transmembrane domain: 

amino acids 579-599 

EGF-like domain cysteine pattern signature, 
amino acids 430-442 
Leucine zipper pattern, 
amino acids 197-219, 269-291 
N-glycosylation sites. 

amino acids 101-105, 117-121, 273-277, 500-504, 528-532 
Tyrosine kinase phosphorylation sites, 
amino acids 124-131, 337-345 
N-myristoylation sites. 

amino acids 23-29, 27-33, 70-76, 142-148, 187-193, 348-354, 
594-600, 640-646 
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FIGURE 17 



GCAGCGGCGAGGCGGCGGTGGTGGCTGAGTCCGTGGTGGCAGAGGCGAAGGCGACAGCTCATOCG 
GGTCCGGATAGGGCTGACGCTGCTGCTGTGTGCGGTGCTGCTGAGCTTGGCCTCGGCGTCCTCGG 
ATGAAGAAGGCAGCCAGGATGAATCCT TAGATTCCAAGACTACT TTGACAT CAGATGAGTCAGTA 
AAGGACCATACTACTGCAGGCAGAGTAGTTGCTGGTCAAATATTTCTTGATTCAGAAGAATCTGA 
ATTAGAATCCTCTATTCAAGAAGAGGAAGACAGCCTCAAGAGCCAAGAGGGGGAAAGTGTCACAG 
AAGATATCAGCT T T C T AGAGTCT CCAAATC CAGAAAACAAGGAC TAT GAAGAGCCAAAGAAAG TA 
|* CGGAAACCAGCTTTGACCGCCATTGAAGGCACAGCACATGGGGAGCCCTGCCACTTCCCTTTTCT 
13 T T TCCTAGATAAGGAGTATGAT GAAT GTACATCAGAT GGGAGGGAAGAT GGCAGAC TG T GG T G T G 

Jif C T AC AACC TATGAC TACAAAGC AGAT GAAAAG TGGGGCTTTTGT GAAAC TG AAGAAGAGGC T GC T 

T| AAGAGACGGCAGATGCAGGAAGCAGAAATGATGTATCAAACTGGAATGAAAATCCTTAATGGAAG 
|| CAATAAGAAAAGCCAAAAAAGAGAAGCATAT CGG TAT CTCCAAAAGGCAGCAAGCATGAACCAT A 

|3 CCAAAGCCCTGGAGAGAGTGTCATATGCTCTTTTATTTGGTGATTACTTGCCACAGAATATCCAG 

GCAGCGAGAGAGAT GT TTGAGAAGCTGACT GAGGAAGGCT C TC CCAAGGGACAGAC TGCT C T TGG 
CTTTCTGTATGCCTCTGGACTTGGTGTTAATTCAAGTCAGGCAAAGGCTCTTGTATATTATACAT 
TTGGAGCTCTTGGGGGCAATCTAATAGCCCACATGGTTTTGGTAAGTAGACTTTAGTGGAAGGCT 



: -cr 



m AAT AATAT T AACATCAGAAGAAT TT G T GGT T TATAGCGGC CACAAC t T T T T CAGCT T TCATGAT C 



C AG AT T T G C T T G TAT T AAG AC C AAA TAT T C AG T T G AAC T T C C T T C AAAT T C T T G T T AA T G GAT AT 
AACACAT GGAAT C TACATG TAAATGAAAGT T GGTGGAGTC CACAAT T T T T CT T TAAAAT GAT TAG 
TTTGGCTGATTGCCCCTAAAAAGAGAGATCTGATAAATGGCTCTTTTTAAATTTTCTCTGAGTTG 
GAAT T G T CAGAAT CAT T T T T T AC AT T AGAT TAT CAT AAT T T TAAAAAT T T T TCTT TAG TTTTTCA 
AAAT T T TGTAAATGG T GGC TATAGAAAAACAACATGAAATAT TATACAATAT T T T GCAACAATGC 
CCTAAGAATTGTTAAAATTCATGGAGTTATTTGTGCAGAATGACTCCAGAGAGCTCTACTTTCTG 
TTTTTTACTTTTCATGATTGGCTGTCTTCCCATTTATTCTGGTCATTTATTGCTAGTGACACTGT 
GCCTGCTTCCAGTAGTCTCATTTTCCCTATTTTGCTAATTTGTTACTTTTTCTTTGCTAATTTGG 
AAGAT TAAC T CAT T T T TAATAAAAT TAT GT C T AAGAT T AAAAAA&AAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 18 



MRVRIGLTLLLCAVLLSIASASSDEEGSQDESLDSKTTLTSDESVKDHTTAGRWAGQIFLDSEESEL 
ESSIQEEEDSLKSQEGESVTEDISFLESPNPENKDYEEPKKVRKPALTAIEGTAHGEPCHFPFLFLDK 
EYDECTSDGREDGRLWCATTYDYKADEKWGFCETEEEAAK^ 

EAYRYLQKAA.SMNHTPCALERVSYALLFGDYLPQNIQAAREMFEKLTEEGSPKGQTALGFLYASGLGVN 
S S QAKALVY YT FGALGGNL I AHMVLVS RL 

Important features: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 195-199, 217-221, 272-276 

Tyrosine kinase phosphorylation site. 

amino acids 220-228 

N-myristoylation sites. 

amino acids 120-126, 253-259, 268-274, 270-274, 285-291, 289-295 

Glycosaminoglycan attachment site. 

amino acids 267-271 

Microbodies C-terminal targeting signal. 

amino acids 299-303 

Type II fibronectin collagen-binding domain protein. 

amino acids 127-169 

Fructose-bisphosphate aldolase class-II protein. 

amino acids 101-119 
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FIGURE iq 



AATTCAGATTTTAAGCCCATTCTGCAGTGGAATTTCATGAACTAGCAAGAGGACACCATCTTCTT 
GTATTATACAAGAAAGGAGTGTACCTATCACACACAGGGGGAAAAATGCTCTTTTGGGTGCTAGG 
CCTCCTAATCCTCTGTGGTTTTCTGTGGACTCGTAAAGGAAAACTAAAGATTG7\AGACATCACTG 
ATAAGTACAT TT T TAT CAC T GGATGT GACTCGGGCT T T GGAAAC T TGGC AGCCAGAAC T TT T GAT 
AAAAAGGGATTTCATGTAATCGCTGCCTGTCTGACTGAATCAGGATCAACAGCTTTAAAGGCAGA 
AACCTCAGAGAGACTTCGTACTGTGCTTCTGGATGTGACCGACCCAGAGAATGTCAAGAGGACTG 
CCCAGTGGGTGAAGAACCAAGTTGGGGAGAAAGGTCTCTGGGGTCTGATCAATAATGCTGGTGTT 
CCCGGCGTGCTGGCTCCCACTGACTGGCTGACACTAGAGGACTACAGAGAACCTATTGAAGTGAA 
CCTGTTTGGACTCATCAGTGTGACACTAAATATGCTTCCTTTGGTCAAGAAAGCTCAAGGGAGAG 
TTATTAATGTCTCCAGTGTTGGAGGTCGCCTTGCAATCGTTGGAGGGGGCTATACTCCATCCAAA 
T ATGCAGTGGAAGGT T TCAATGACAGCT TAAGACGGGACAT GAAAGC T TTT GGT GTGCACGT CT C 
ATGCAT TGAACCAGGATTGT TCAAAACAAACTTGGCAGATC CAGTAAAGGTAAT TGAAAAAAAAC 
TCGCCATTTGGGAGCAGCTGTCTCCAGACATCAAACAACAATATGGAGAAGGTTACATTGAAAAA 
« AGTCTAGAGAAACTGAAAGGCAATAAATCCTATGTGAACATGGACCTCTCTCCGGTGGTAGAGTG 
III CATGGACCACGCTCTAACAAGTCTCT T CCCTAAGAC TCAT TATGCCGCTGGAAAAGATGCCAAAA 

m 

TTTTCTGGATACCTCTGTCTCACATGCCAGCAGCTTTGCAAGACTTTTTATTGTTGAAACAGAAA 
GCAGAGCTGGCTAATCCCAAGGCAGTG TGA CTCAGCTAACCACAAATGTCTCCTCCAGGCTATGA 
AATTGGCCGATTTCAAGAACACATCTCCTTTTCAACCCCATTCCTTATCTGCTCCAACCTGGACT 
f| CATTTAGATCGTGCTTATTTGGATTGCAAAAGGGAGTCCCACCATCGCTGGTGGTATCCCAGGGT 

CCCTGCTCAAGTTTTCTTTGAAAAGGAGGGCTGGAATGGTACATCACATAGGCAAGTCCTGCCCT 
GTATTTAGGCTTTGCCTGCTTGGTGTGATGTAAGGGAAATTGAAAGACTTGCCCATTCAAAATGA 
TCTTTACCGTGGCCTGCCCCATGCTTATGGTCCCCAGCATTTACAGTAACTTGTGAATGTTAAGT 
AT CAT C TC T TAT C T AAATAT TAAAAGATAAGTCAAC CCAAAAAAAAAAAAAAAAAAAAAAAAAM 
AAAAAAAAAAAAA 
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FIGURE 20 



MLFWVLGLLILCGFLWTRKGKLKIEDITDKYIFITGCDSGFGNLAARTFDKKGFHVIAACLTESG 
S TALKAE TS ERLRTVLLDVTDPENVKRTAQWVKNQVGEKGLWGL INNAGVPGVLAPTDWLT LED Y 
REP IEVNLTGLI SVTLNMLPLVKKAQGRVINVS SVGGRIAI VGGGYTPSKYAVEGFNDSLRRDMK 
AFGVHVSCIEPGLFKTNIJU)PVKVIEKKIAIWEQLSPDIKQQYGEGYIEKSLDKLKGNKSYVNMD 
LSPWECMDHALTSLFPKTHYAAGKDAKIFWIPLSHMPAALQDFLLLKQKAELANPKAV 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

Transmembrane domain: 

amino acids 136-152 

N-glycosylation sites. 

amino acids 161-163, 187-190 and 253-256 

Glycosaminoglycan attachment site. 

amino acids 39-42 

N-myristoylation sites. 

amino acids 36-41, 42-47, 103-113, 166-171, 198-203 and 207-212 
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FIGURE 21 



CTGAGGCGGCGGTAGCATGGAGGGGGAGAGTACGTCGGCGGTGCTCTCGGGCTTTGTGCTCGGCG 
CACTCGCTTTCCAGCACCTCAACACGGACTCGGACACGGAAGGTTTTCTTCTTGGGGAAGTAAAA 
GGTGAAGCCAAGAACAGCAT TACTGAT TCCCAAATGGATGATGT TGAAGT T GT T TATACAATTGA 
CATTCAGAAATATATTCCATGCTATCAGCTTTTTAGCTTTTATAATTCTTCAGGCGAAGTAAATG 
AGCAAGCACTGAAGAAAATATTATCAAATGTCAAAAAGAATGTGGTAGGTTGGTACAAATTCCGT 
CGTCATTCAGATCAGATCATGACGTTTAGAGAGAGGCTGCTTCACAAAAACTTGCAGGAGCATTT 
T TCAAACCAAGACCTT GT T TTTCTGCTAT TAACACCAAGTATAATAACAGAAAGCTGC TCTACTC 
ATCGACTGGAACATTCCTTATATAAACCTCAAAAAGGACTTTTTCACAGGGTACCTTTAGTGGTT 
GCCAATCTGGGCATGTCTGAACAACTGGGTTATAAAACTGTATCAGGTTCCTGTATGTCCACTGG 
T T TT AGCCGAGCAGTACAAACACAC AGCTCTAAAT TT TT TGAAGAAGATGGATCCT TAAAGGAGG 
T AC ATAAGAT AAAT GAAAT GT AT GC T T CAT T ACAAGAG GAAT T AAAGAGT AT AT GCAAAAAAG T G 
GAAGACAGTGAACAAGCAGTAGATAAACTAGTAAAGGATGTAAACAGATTAAAAGGAGAAATTGA 
fl GAAAAGGAGAGGAGCACAGAT TCAGGC AGCAAGAGAGAAGAACAT CCAAAAAGACCCTCAGGAGA 

3? 



ACATTTTTCTTTGTCAGGCATTACGGACCTTTTTTCCAAATTCTGAATTTCTTCATTCATGTGTT 
ATGTCTTTAAAAAATAGACATGTTTCTAAAAGTAGCTGTAACTACAACCACCATCTCGATGTAGT 
AGACAATC T GAC C T T AATGG T AGAAC ACAC TGAC AT T C CT GAAGC TAG T CC AGC T AGT AC AC CAC 
AAAT CAT T AAG C AT AAAGC C T TAGAC T T AGAT GACAGAT GG CAAT T C AAGAGAT CTCGGTTGTTA 
GAT ACACAAGACAAACGAT C TAAAGCAAATAC T GGTAGTAGTAACCAAGAT AAAGCATCCAAAAT 
GAGCAGCCCAGAAACAGATGAAGAAAT TGAAAAGATGAAGGGTT TTGG TGAATAT TCACGGTCTG 
C TACATT TTGAT CCT TTTAACCT TACAAGGAGAT TTTTTTATTTGGCTGAT GGGTAAAGCCAAAC 
ATTTCTATTGTTTTTACTATGTTGAGCTACTTGCAGTAAGTTCATTTGTTTTTACTATGTTCACC 
T GT TTGCAGTAATA.CACAGATAACT CT TAG TGCATTTACT TCACAAAG TACTTT T TCAAACATCA 
GATGCTTTTATTTCCAAACCTTTTTTTCACCTTTCACTAAGTTGTTGAGGGGAAGGCTTACACAG 
ACACAT TC T T T AGAAT T GGAAAAG T GAGAC CAGGCACAGT GGCT CAC AC C T G T AATCC CAGC AC T 
TAGGGAAGACAAGTCAGGAGGATTGATTGAAGCTAGGAGTTAGAGACCAGCCTGGGCAACGTATT 
GAGAC CAT G T C T AT T AAAAAAT AAAAT G GAAAAG CAAG AAT AG CCTTATTTT CAAAAT AT G GAAA 
GAAAT TT AT ATGAAAAT T T ATC T GAG T CAT T AAAAT T C T C C T TAAG T GAT AC T T T T T TAG AAG T A 
CAT TAT GGCT AGAGT T GCC AGAT AAAATGC T GGATAT C AT GCAATAAATT T GCAAAACAT CAT CT 
AAAAT TTAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 22 



MEGESTSAVLSGFVLGALAFQHLNTDSDTEGFLLGEVKGEAKNSITDSQMDDVEWYTIDIQKYI 
PCYQLFSFYNSSGEVNEQALKKILSNVKKNWGWYKFRRHSDQIMTFRERLLHKNLQEHFSNQDL 
VFLLLTPSIITESCSTHRLEHSLYKPQKGLFHRVPLWANLGMSEQLGYKTVSGSCMSTGFSRAV 
QTHSSKFFEEDGSLKEVHKINEMYASLQEELKSICKKVEDSEQAVDKLVKDVNRLKREIEKRRGA 
Q I QAAREKN I QKDPQEN I FLCQALRT FFPNS E FLHSCVMS LKNRHVSKSS CNYNHHLD WDNLT L 
MVEHTDIPEASPASTPQIIKHKALDLDDRWQFKRSRLLDTQDKRSKANTGSSNQDKASKMSSPET 
DEEIEKMKGFGEYSRSPTF 

Important features: 
Signal peptide: 

amino acids 1-19 

N-glycosylation sites. 

amino acids 75-79, 322-326 

N-myristoylation site. 

amino acids 184-154 

Growth factor and cytokines receptors family. 

amino acids 134-150 
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FIGURE 23 



GGCACAGCCGCGCGGCGGAGGGCAGAGTCAGCCGAGCCGAGTCCAGCCGGACGAGCGGACCAGCGCAGGGCAGCCCAA 
GCAGCGCGCAGCGAACGCCCGCCGCCGCCCACACCGTCTGCGGTCCCCGCGGCGCCTGCCACCCTTCCCTCCTTCCCC 
GCGTCCCCGCCTCGCCGGCCAGTCAGCTTGCCGGGTTCGCTGCCCCGCGAAACCCCGAGGTCACCAGCCCGCGCCTCT 
GCTTCCCTGGGCCGCGCGCCGCCTCCACGCCCTCCTTCTCCCCTGGCCCGGCGCCTGGCACCGGGGACCGTTGCCTGA 
CGCGAGGCCCAGCTCTACTTTTCGCCCCGCGTCTCCTCCGCCTGCTCGCCTCTTCCACCAACTCCAACTCCTTCTCCC 
TCCAGCTCCACTCGCTAGTCCCCGACTCCGCCAGCCCTCGGCCCGCTGCCGTAGCGCCGCTTCCCGTCCGGTCCCAAA 
GGTGGGAACGCGTCCGCCCCGGCCCGCACCATGGCACGGTTCGGCTTGCCCGCGCTTCTCTGCACCCTGGCAGTGCTC 
AGCGCCGCGCTGCTGGCTGCCGAGCTCAAGTCGAAAAGTTGCTCGGAAGTGCGACGTCTTTACGTGTCCAAAGGCTTC 
AACAAGAACGATGCCCCCCTCCACGAGATCAACGGTGATCATTTGAAGATCTGTCCCCAGGGTTCTACCTGCTGCTCT 
CAAGAGAT GGAGGAGAAGT ACAGCCT GCAAAGTAAAGATGATT T CAAAAGT GT GGT CAGC GAACAGTGCAAT CATTTG 
C AAGCTGT CTT T GCT T CACGTT ACAAGAAGTTT GAT GAAT T C TT CAAAGAACTACTT GAAAAT GCAGAGAAAT CCCT G 
AATGATAT GT T T GTGAAGACAT ATGGCCATT TAT ACATGCAAAAT T CT GAGCTAT TT AAAGATCT CTTCGT AGAGTTG 
AAACGTTACTACGTGGTGGGAAATGT GAACCTGGAAGAAATGCTAAATGACTTCT GGGCTCGCCT CCT GGAGCGGATG 
TTCCGCCTGGTGAACTCCCAGTACCACTTTACAGATGAGTATCTGGAATGTGTGAGCAAGTATACGGAGCAGCTGAAG 
CCCTTCGGAGATGTCCCTCGCAAATTGAAGCTCCAGGTTACTCGTGCTTTTGTAGCAGCCCGTACTTTCGCTCAAGGC 
TTAGC G GT T GCGG GAGAT GT CGT GAG CAAGG TCT C C GTGGT AAACCCC ACAGC C CAGT GTACCCAT GC C CT GTTGAAG 
AT GAT C TACTGCT CCCACT GCCGGGGTCT C GTGACT GT GAAGC CAT GTTACAACTACT GCT CAAACAT CAT GAGAGGC 
T GT TT G GCCAAC CAAGGG GAT CT CGATTTT GAAT GGAACAATTT CATAGAT GCT AT GCTGAT G GT GGCAGAGAGG CTA 
GAGGGT C CTT TC AACAT T GAATC GGT CAT GGAT C C CATC GATGT GAAGAT TTCT GAT GCTATTAT GAAC AT GCAGGAT 
AATAGTGT T CAAGTGT CT CAGAAGGT TTT C CAG GGAT GTGGAC C CC C CAAGCC C CT C C CAGCT GGACGAAT TT CT C GT 
TCCATCTCTGAAAGTGCCTTCAGTGCTCGCTTCAGACCACATCACCCCGAGGAACGCCCAACCACAGCAGCTGGCACT 
AGT TT GGACCGACTGGTTACT GAT GT CAAGGAGAAACT GAAAC AGGCCAAGAAAT TCTGGTCCTCCCTTCC GAGC AAC 
GTTTGCAACGATGAGAGGATGGCTGCAGGAAACGGCAATGAGGATGACTGTTGGAATGGGAAAGGCAAAAGCAGGTAC 
CT GTT T GCAGT GACAGGAAAT GGATT AGC CAAC CAGGGCAACAAC C CAGAG GT CCAGGT T GACAC CAGCAAACCAGAC 
AT ACT GAT C CTT CGT CAAAT CAT GGCTCTT CGAGT GAT GAC CAGCAAGAT GAAGAAT G CAT ACAAT GGGAACGACGTG 
GACTTCTTTGATATCAGTGATGAAAGTAGTGGAGAAGGAAGTGGAAGTGGCTGTGAGTATCAGCAGTGCCCTTCAGAG 
TTTGACTACAATGCCACTGACCATGCTGGGAAGAGTGCCAA.TGAGAAAGCCGACAGTGCTGGTGTCCGTCCTGGGGCA 
CAGGCCTACCTCCTCACTGTCTTCTGCATCTTGTTCCTGGTTATGCAGAGAGAGTGGAGATAATTCTCAAACTCTGAG 
AAAAAGTGTTCATCAAAAAGTTAAAAGGCACCAGTTATCACTTTTCTACCATCCTAGTGACTT 

TGGACAACAATGTACAGTTTTTACTATGTGGCCACTGGTTTAAGAAGTGCTGACTTTGTTTTCTCATTCAGTTTTGGG 
AGGAAAAGGGACTGTGCATTGAGTTGGTTCCTGCTCCCCCAAACCATGTTAAACGTGGCTAACAGTGTAGGTACAGAA 
CTATAGTTAGTTGTGCATTTGTGATTTTATCACTCTATTATTTGTTTGTATGTTTTTTTCTCATTTCGTTTGTGGGTT 
TTTTTTTCCAACTGTGATCTCGCCTTGTTTCTTACAAGCAAACCAGGGTCCCTTCTTGGCACGT AACAT GTACGTATT 
TCTGAAATATTAAAT AGCT GTACAGAAGCAGGTTTTATTTATCATGTTAT CTTATTAAAAGAAAAAGCCCAAAAAGC 
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FIGURE 24 

MARFGLPALLCTLAVLSAALLAAELKSKSCSEVRRLYVSKGFNKNDAPLHEINGDHLKICPQGST 
CCSQEMEEKYSLQSKDDFKSWSEQCNHLQAVFASRYKKFDEFFKELLENAEKSLNDMFVKTYGH 
LYMQNSELFKDLF^ELKRYYWGNWLEEMLNDFWARLLERMFRLVNSQYHFTDEYLECVSKYTE 
QLKPFGDVPRKLKLQVTRAFVAARTFAQGLAVAGDWSKVSVVNPTAQCTHALLKMIYCSHCRGL 
VTVKPCYNYCSNIMRGCLANQGDLDFEWNNFIDAMLMVAERLEGPFNIESVMDPIDVKISDAIMN 
MQDNSVQVSQKVFQGCGPPKPLPAGRISRSISESAFSARFRPHHPEERPTTAAGTSLDRLVTDVK 
EKLKQAKKFWS SLPSNVCNDERMAAGNGNEDDCWNGKGKSRYLFAVT GNGLANQGNNPEVQVDTS 
KPDI L I LRQ IMALRVMTSKMKNAYNGNDVDFFD I SDE SSGEGS GS GCE YQQCPSE FDYNAT DHAG 
KSANEKADS AGVRPGAQAYLLT VFC I LFLVMQREWR 

Important features: 
Signal peptide: 

amino acids 1-22 

ATP/GTP-binding site motif A (P-loop) . 

amino acids 515-524 

N-glycosylation site. 

amino acids 514-518 

Glycosaminoglycan attachment sites. 

amino acids 494-498, 498-502 

N-myristoylation sites. 

amino acids 63-69, 224-230, 276-282, 438-444, 497-503, 531-537 
Glypicans proteins. 

amino acids 54-75, 105-157, 238-280, 309-346, 423-460, 468-506 
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FIGURE 25 



CTCGCCCTCAAATGGGAACGCTGGCCTGGGACTAAAGCATAGACCACCAGGCTGAGTATCCTGAC 
CTGAGTCATCCCCAGGGATCAGGAGCCTCCAGCAGGGAACCTTCCATTATATTCTTCAAGCAACT 
TACAGCTGCACCGACAGTTGCGATGAAAGTTCTAATCTCTTCCCTCCTCCTGTTGCTGCCACTAA 
TGCTGATGTCCATGGTCTCTAGCAGCCTGAATCCAGGGGTCGCCAGAGGCCACAGGGACCGAGGC 
CAGGCTTCTAGGAGATGGCTCCAGGAAGGCGGCCAAGAATGTGAGTGCAAAGATTGGTTCCTGAG 
AGCCCCGAGAAGAAAATTCATGACAGTGTCTGGGCTGCCAAAGAAGCAGTGCCCCTGTGATCATT 
TCAAGGGCAATGTGAAGAAAACAAGACACCAAAGGCACCACAGAAAGCCAAACAAGCATTCCAGA 
GCCTGCCAGCAATTTCTCAAACAATGTCAGCTAAGAAGCTTTGCTCTGCCTTTG TAGG AGCTCTG 
AGCGCCCACTCTTC CAAT TAAACAT T C TCAGCCAAGAAGACAG T GAGCACACCTACCAGACACTC 
TTCTTCTCCCACCTCACTCTCCCACTGTACCCACCCCTAAATCATTCCAGTGCTCTCAAAAAGCA 
TGTTTTTCAAGATCATTTTGTTTGTTGCTCTCTCTAGTGTCTTCTTCTCTCGTCAGTCTTAGCCT 
GTGCCCTCCCCTTACCCAGGCTTAGGCTTAATTACCTGAAAGATTCCAGGAAACTGTAGCTTCCT 
AGCTAGTGTCATTTAACC T TAAATGCAATCAGGAAAGTAGCAAACAGAAGTCAATAAATATT TTT 
AAATGTCAAAAAAAAAAAAAAAAAA 
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FIGURE 26 



MKVLISSLLLLLPIMMSMVSSSLNPGVARGHRDRGQASRRWLQEGGQECECKDWFLRAPRRKFM 
TVSGLPKKQCPCDHFKGNVKKTRHQRHHRKPNKHSRACQQFLKQCQLRSFALPL 

Important features: 
Signal peptide: 

amino acids 1-22 

N-myristoylation sites. 

amino acids 27-33, 46-52 
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FIGURE 27 



GGACGCCAGCGCCTGCAGAGGCTGAGCAGGGAAAAAGCCAGTGCCCCAGCGGAAGCACAGCTCAG 
AGCTGGTCTGCC ATG GACATCCTGGTCCCACTCCTGCAGCTGCTGGTGCTGCTTCTTACCCTGCC 
CCTGCACCTCATGGCTCTGCTGGGCTGCTGGCAGCCCCTGTGCAAAAGCTACTTCCCCTACCTGA 
TGGCCGTGCTGACTCCCAAGAGCAACCGCAAGATGGAGAGCAAGAAACGGGAGCTCTTCAGCCAG 
ATAAAGGGGCTTACAGGAGCCTCCGGGAAAGTGGCCCTACTGGAGCTGGGCTGCGGAACCGGAGC 
CAACTTTCAGTTCTACCCACCGGGCTGCAGGGTCACCTGCCTAGACCCAAATCCCCACTTTGAGA 
AGTTCCTGACAAAGAGCATGGCTGAGAACAGGCACCTCCAATATGAGCGGTTTGTGGTGGCTCCT 
p GGAGAGGACATGAGACAGCTGGCTGATGGCTCCATGGATGTGGTGGTCTGCACTCTGGTGCTGTG 
13 CTCT GTGCAGAGCC CAAGGAAGGTCC T GCAGGAGGT CCGGAGAGT AC TGAGACC GGGAGGT GTGC 

11= T C T T T T TC TGGGAGCAT GTGGC AGAAC CAT AT GGAAGCT GGGC C T TC AT G T GGC AGCAAGT T T T C 

J| GAGCCCACCTGGAAACACATTGGGGATGGCTGCTGCCTCACCAGAGAGACCTGGAAGGATCTTGA 
ft GAACGCCCAGTTCTCCGAAATCCAAATGGAACGACAGCCCCCTCCCTTGAAGTGGCTACCTGTTG 
If! GGCCCCACATCATGGGAAAGGCTGTCAAACAATCTTTCCCAAGCTCCAAGGCACTCATTTGCTCC 
i TTCCCCAGCCTCCAATTAGAACAAGCCACCCACCAGCCTATCTATCTTCCACTGAGAGGGACCTA 
*g GCAGAAT GAGAGAAGACAT TCAT G TACCAC CTAC TAGTCC C TC T C T CCCCAACC TCT GCCAGGGC 

l^l AAT C T C TAAC T T CAAT C CCGCC T T CG ACAG T GAAAAAGCT C TAC T T C T ACGC TGACCCAGGGAG G 

AAACACTAGGACCCTGTTGTATCCTCAACTGCAAGTTTCTGGACTAGTCTCCCAACGTTTGCCTC 
|2 CCAATGTTGTCCCTTTCCTTCGTTCCCATGGTAAAGCTCCTCTCGCTTTCCTCCTGAGGCTACAC 
If CCATGCGTCTCTAGGAACTGGTCACAAAAGTCATGGTGCCTGCATCCCTGCCAAGCCCCCCTGAC 

CCTCTCTCCCCACTACCACCTTCTTCCTGAGCTGGGGGCACCAGGGAGAATCAGAGATGCTGGGG 
ATGCCAGAGCAAGACT CAAAGAGGCAGAGG T T T T GT T CT CAAATAT T T T T T AATAAATAGACGAA 
ACCACG 
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FIGURE 28 



MDILVPLLQLLVLLLTLPLHLMALLGCWQPLCKSYFPYI^VLTPKSNRKMESKKRELFSQIKGL 
TGASGKVALLELGCGTGANFQFYPPGCRVTCLDPNPHFEKFLTKSMAENRHLQYERFVVAPGEDM 
RQLADGSMDVWCTLVLCSVQSPRKVLOETORVLRPGGVLFFWEHVAEPYGSWAFMWQQVFEPTW 
KHIGDGCCLTRETWKDLENAQFSEIQMERQPPPLKWLPVGPHIMGKAVKQSFPSSKALICSFPSL 
QLE QATHQP I YLPLRGT 

Important features: 
C| Signal peptide: 

w amino acids 1-23 

If 8 

f£$ Leucine zipper pattern. 

J3 amino acids 10-32 

N-myristoylation sites, 

amino acids 64-70, 78-84, 80-86, 91-97, 201-207 
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FIGURE 2Q 



C^ATGTTTGCCTATCCACCTCCCCCAAGCCCCTTTACCTATGCTGCTGCTAACGCTGCTGCTGCT 
GCTGCTGCTGCTTAAAGGCTCATGCTTGGAGTGGGGACTGGTCGGTGCCCAGAAAGTCTCTTCTG 
CCACTGACGCCCCCATCAGGGATTGGGCCTTCTTTCCCCCTTCCTTTCTGTGTCTCCTGCCTCAT 
CGGCCTGCCATGACCTGCAGCCAAGCCCAGCCCCGTGGGGAAGGGGAGAAAGTGGGGGATGGCTA 
AGAAAGCTGGGAGATAGGGAACAGAAGAGGGTAGTGGGTGGGCTAGGGGGGCTGCCTTATTTAAA 
GTGGTTGT TTATGAT T CTTATACTAAT T TATACAAAGATAT TAAGGCCCTGTTCATTAAGAAAT T 
GTTCCCTTCCCCTGTGTTCAATGTTTGTAAAGATTGTTCTGTGTAAATATGTCTTTATAATAAAC 
in AGTTAAAAGCTGAAAAAAAAAAAAAAAAAAAAAAAAAAA 

p 

m 



m 
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FIGURE 30 

MLLLTLLLLLLLLKGSCLEWGLVGAQKVSSATDAPIRDWAFFPPSFLCLLPHRPAMTCSQAQPRG 
EGEKVGDG 



Important features: 
Signal peptide: 

amino acids 1-15 

Growth factor and cytokines receptors family: 

Vj amino acids 3-18 



ri 
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FIGURE 31 



ci 



G T T T GAAT T C C T T C AAC TAT AC C C AC AG T C C AAAAG C AG AC T C AC T G T G T C C C AGG C T AC C AG T T 
CCTCCAAGCAAGTCATTTCCCTTATTTAACCGATGTGTCCCTCAAACACCTGAGTGCTACTCCCT 
ATTTGCATCTGTTTTGATAAATGATGTTGACACCCTCCACCGAATTCTAAGTGGAATCATGTCGG 
GAAGAGATACAATCCTTGGCCTGTGTATCCTCGCATTAGCCTTGTCTTTGGCCATGATGTTTACC 
TTCAGATTCATCACCACCCTTCTGGTTCACATTTTCATTTCATTGGTTATTTTGGGATTGTTGTT 
TGTCTGCGGTGTTTTATGGTGGCTGTATTATGACTATACCAACGACCTCAGCATAGAATTGGACA 
CAGAAAGGGAAAATATGAAGTGCGTGCTGGGGTTTGCTATCGTATCCACAGGCATCACGGCAGTG 
CTGCTCGTCTTGATTTTTGTTCTCAGAAAGAGAATAAAATTGACAGTTGAGCTTTTCCAAATCAC 
AAATAAAGCCATCAGCAGTGCTCCCTTCCTGCTGTTCCAGCCACTGTGGACATTTGCCATCCTCA 
TTTTCTTCTGGGTCCTCTGGGTGGCTGTGCTGCTGAGCCTGGGAACTGCAGGAGCTGCCCAGGTT 
ATGGAAGGCGGCCAAGTGGAATATAAGCCCCTTTCGGGCATTCGGTACATGTGGTCGTACCATTT 



IN AATTGGCCTCATCTGGACTAGTGAATTCATCCTTGCGTGCCAGCAAATGACTATAGCTGGGGCAG 

y TGGTTACTTGTTATTTCAACAGAAGTAAAAATGATCCTCCTGATCATCCCATCCTTTCGTCTCTC 

TCCATTCTCTTCTTCTACCA.TCAAGGAACCGTTGTGAAAGGGTCATTTTTAATCTCTGTGGTGAG 

Si 

Q GATTCCGAGAATCATTGTCATGTACATGCAAAACGCACTGAAAGAACAGCAGCATGGTGCATTGT 

jjf CCAGGTACCTGTTCCGATGCTGCTACTGCTGTTTCTGGTGTCTTGACAAATACCTGCTCCATCTC 

**J AAC CAGAATGCATATAC TACAAC TGCT AT TAATGGGAC AGAT T T C TGTACATCAGCAAAAGATGC 



AT TCAAAATCTT GT CCAAGAACTCAAGTCACTTTACAT CTAT TAACT GCTT TGGAGACTTCATAA 
TTTTTCTAGGAAAGGTGTTAGTGGTGTGTTTCACTGTTTTTGGAGGACTCATGGCTTTTAACTAC 
AATCGGGCATTCCAGGTGTGGGCAGTCCCTCTGTTATTGGTAGCTTTTTTTGCCTACTTAGTAGC 
CCATAGTTTTTTATCTGTGTTTGAAACTGTGCTGGATGCACTTTTCCTGTGTTTTGCTGTTGATC 
T GGAAACAAATGAT GGAT CGTCAGAAAAGCCCTACT T TAT GGATCAAGAAT T TC TGAGTT TCGTA 
AAAAGGAGCAACAAATTAAACAATGCAAGGGCACAGCAGGACAAGCACTCATTAAGGAATGAGGA 
GGGAACAGAACT CCAGGCCATTGTGAG ATAGA TACC CATT TA.GGTATCTGTACC TGGAAAACATT 
T CCT TC TAAGAGCCAT T T ACAGAAT AGAAGAT GAGACC ACT AGAG AAAAG T TAG T GAAT T T T T T T 
TTAAAAGACCTAATAAACCCTATTCTTCCTCAAAA 
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FIGURE 32 



MSGRDT I LGLCI IALALS LAMMFT FRFI TTLLVH I FI SLVI LGLLFVCGVLWWLYYDYTNDLS IE 
L DTERENMKC VL G FAI VS TGI TAVL LVL I FVLRKRT KL T VE L FQ I TNKAI S SAP FLL FQ PL WT FA 
ILIFFWVLWAVLLSLGTAGAAQVMEGGQVEYKPLSGIRYTyR^SYHLIGLIWTSEFILACQQMTIA 
GAWTCYFNRSKNDPPDHPILSSLSILFFYHQGTWKGSFLISWRIPRIIVMYMQNALKEQQHG 
ALSRYLFRCCYCCFWCLDKYLLHLNQNAYTTTAINGTDFCTSAKDAFKILSKNSSHFTSINCFGD 
F 1 1 FLGKVLWC FT VFGGLMAFNYNRAFQVWAVP LL LVAFFAYLVAH S FLSVFE TVLDAL FLC FA 
VDLETNDGSSEKPYFMDQEFLSFVKRSNKLNNARAQQDKHSLRNEEGTELQAIVR 

3 .5 

O Important features: 

m 

~f] Signal peptide: 

*J amino acids 1-20 

6 

ffl Putative transmembrane domains: 

* amino acids 35-54, 75-97, 126-146, 185-204, 333-350, 352-371 

N-glycosylation sites. 

|3 amino acids 204-208, 295-299, 313-317 

I# N-myristoylation sites. 

amino acids 147-153, 178-184, 196-202, 296-275, 342-348 
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FIGURE 33 



GTTCGATTAGCTCCTCTGAGAAGAAGAGAAAAGGTTCTTGGACCTCTCCCTGTTTCTTCCTTAGA 
ATAATT T G TATGGGAT T TGTGAT GCAGGAAAGCC TAAGG GAAAAAGAAT AT T CAT TC T GT GT GGT 
GAAAATTTTTTGAAAAAAAAATTGCCTTCTTCAAACAAGGGTGTCATTCTGATATTTATGAGGAC 
TGTTGTTCTCACTATGAAGGCATCTGTTATTGAAATGTTCCTTGTTTTGCTGGTGACTGGAGTAC 
AT TCAAAC AAAGAAAC GGC AAAGAAGAT TAAAAGGC CC AAG T TCACT GT GC C T C AGATCAAC T GC 
GATGTCAAAGCCGGAAAGATCATCGATCCTGAGTTCATTGTGAAATGTCCAGCAGGATGCCAAGA 
CCCCAAATACCATGTTTATGGCACTGACGTGTATGCATCCTACTCCAGTGTGTGTGGCGCTGCCG 
TACACAGTGGTGTGCTTGATAATTCAGGAGGGAAAATACTTGTTCGGAAGGTTGCTGGACAGTCT 
GGTTACAAAGGGAGTTATTCCAACGGTGTCCAATCGTTATCCCTACCACGATGGAGAGAATCCTT 
- , TATCGTCT TAGAAAGT AAACCCAAAAAGGG TG TAACCT AC CCAT C AGC T CT TACATAC TCAT CAT 

J£ CGAAAAGTCCAGCTGCCCAAGCAGGTGAGACCACAAAAGCCTATCAGAGGCCAlCCTATTC 
U ACAACTGCACAGCCGGTCACTCTGATGCAGCTTCTGGCTGTCACTGTAGCTGTGGCCACCCCCAC 
Q CACCTTGCCAAGGCCATCCCCTTCTGCTGCTTCTACCACCAGCATCCCCAGACCACAATCAGTGG 
m G CCACAGGAGCCAGGAGATGGATCT C T GGT CCAC TGCC AC C TACACAAGCAGCCAAAACAGGCCC 

Zl AGAG C TGAT C CAG G TATCC AAAG GC AAGAT CC T T CAG GAG CTGCCTTC C AGAAAC C T G T T G GAGC 

1:g GGATGTCAGCCTGGGACT TGTTCCAAAAGAAGAATTGAGCACACAGT CT T T GGAGCCAGTAT CCC 

jff TGGGAGATCCAAACTGCAAAATTGACTTGTCGTTTTTAATTGATGGGAGCACCAGCATTGGCAAA 
S3 CGGCGATTCCGAATCCAGAAGCAGCTCCTGGCTGATGTTGCCCAAGCTCTTGACATTGGCCCTGC 
HI C GG T C CAC T GAT GGGTGTTGT CC AGT ATGGAGAC AACC C T GCT AC T C AC T T T AAC C T C AAGAC AC 

1 ACACGAAT TC TCGAGATCTGAAGACAGCCATAGAGAAAATTAC TCAGAGAGGAGGACT T TCTAAT 

GTAGGTCGGGCCATCTCCTTTGTGACCAAGAACTTCTTTTCCAAAGCCAATGGAAACAGAAGCGG 
GGCTCCCAATGTGGTGGTGGTGATGGTGGATGGCTGGCCCACGGACAAAGTGGAGGAGGCTTCAA 
GACTTGCGAGAGAGTCAGGAATCAACATTTTCTTCATCACCATTGAAGGTGCTGCTGAAAATGAG 
AAGCAGTATGTGGTGGAGCCCAACTTTGCAAACAAGGCCGTGTGCAGAACAAACGGCTTCTACTC 
GCTCCACGTGCAGAGCTGGTTTGGCCTCCACAAGACCCTGCAGCCTCTGGTGAAGCGGGTCTGCG 
ACACTGACCGCCTGGCCTGCAGCAAGACCTGCTTGAACTCGGCTGACATTGGCTTCGTCATCGAC 
GGCTCCAGCAGTGTGGGGACGGGCAACTTCCGCACCGTCCTCCAGTTTGTGACCAACCTCACCAA 
AGAGTTTGAGATTTCCGACACGGACACGCGCATCGGGGCCGTGCAGTACACCTACGAACAGCGGC 
TGGAGTTTGGGTTCGACAAGTACAGCAGCAAGCCTGACATCCTCAACGCCATCAAGAGGGTGGGC 
TACTGGAGTGGTGGCACCAGCACGGGGGCTGCCATCAACTTCGCCCTGGAGCAGCTCTTCAAGAA 
GTCCAAGCCCAACAAGAGGAAGT TAAT GATCCTCAT CACCGACGGGAGGTCCTACGACGACGTCC 
GGATCCCAGCCATGGCTGCCCATCTGAAGGGAGTGATCACCTATGCGATAGGCGTTGCCTGGGCT 
GCCCAAGAGGAGCTAGAAG TCAT TGCCACTCACCCCGCCAGAGACCAC TCCT TCTTTGTGGACGA 
GTTTGACAACCT CCAT CAG TATGTCCCCAGGATCAT CCAGAACATTT GTACAGAGTTCAACTCAC 
AGCC T CGGAACTGAAT TCAGAGCAGGC AGAGCACCAGCAAGTGCTGCT TTAC TAACTGACGTGT T 
GGACCACCCCACCGCTTAATGGGGCACGCACGGTGCATCAAGTCTTGGGCAGGGCATGGAGAAAC 
AAAT GTCTTGTTAT TAT T C T T T G C CAT CAT GC T T T T T CAT AT T CC AAAAC T T GG AG T T AC AAAGA 
T GAT CACAAACG TATAJ3AATGAGCC AAAAGGC TACAT CAT G TT GAGGG TGCT GGAGAT TT TACAT 
T T TGA.CAATTGT TT TCAAAATAAAT GT TCGGAATACAGTGCAGCCCT TACGACAGGC TTACGTAG 
AGCTTTTGTGAGATTTTTAAGTTGTTATTTCTGATTTGAACTCTGTAACCCTCAGCAAGTTTCAT 
T T T T G T C A T G AC AAT G TAG GAAT TGCT G AAT TAAAT G T T T AGAAGGAT GAAAAATAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE 24 

MRTWLTMKASVIEMFLVLLYTGVHSNKETAKKIKRPKFTVPQINCDVKAGKIIDPEFIVKCPAG 
CQDPKYHVTGTDVYASYSSVCGAAVHSGVLDNSGGKILVRKVAGQSGYKGSYSNGVQSLSLPRWR 
E S FI VLE SKPKKGVT YPS ALTYSS SKSPAAQAGE TTKAYQRPP I PGT TAQPVTLMQLLAVTVAVA 
TPTTLPRPSP S AAS T T S I PRPQS VGHRS QEMDL W S TAT Y T S S QNRPRADPG I QRQD P S GAAFQKP 
VG AD VS LG L VPKEE L S T QS LE P VS L GDPNCK I DL S FL I DGS T S I GKRR FR I QKQLL ADVAQALD I 
GPAGPLMGWQYGDNPATHFNLKTHTNSRDLKTAIEKITQRGGLSNVGRAISFVTKNFFSKANGN 
: , RSGAPNWWMVDGWPTDKVEEASRL^ 

p. FYSLHVQSWFGLHKTLQPLVKRVCDTDRLACSKTCLNSADIGFVIDGSSSVGTGNFRTVLQFVTN 
O LTKEFEISDTDTRIGAVQYTYEQRLEFGFDKYSSKPDILNAIKRVGYWSGGTSTGAAINFALEQL 

m 

fl FKKSKPNKRKLMI L I T DGRS YDDVRI PAMAAHLKGVI T YAI GVAWAAQEELEVI ATHPARDHS FF 

!;g VDE FDNLHQYVPRI IQNICTEFNSQPRN 

*n 

| ! || Important features: 

* Signal peptide: 

n 

amino acids 1-26 

If? 

!sf Transmembrane domain: 

p amino acids 181-200 

ill 

N-glycosylation sites. 

amino acids 390-394, 520-524 

N-myri s toylation si tes . 

amino acids 23-29, 93-99, 115-121, 262-263, 367-373, 389-395, 
431-437, 466-472, 509-515, 570-576, 571-577, 575-581, 627-633 

Amidation site. 

amino acids 304-308 
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FIGURE as 



CC GAGC AC AG GAGAT T GC CT GCGT TT AGGAGGT GG C TGCGTTGT GGGAAAAGCT ATCAAGGAAG AAAT T GC 
CAAACCATGTCTTTTTTTCTGTTTTCAGAGTAGTTCACAACAGATCTGAGTGTTTTAATTAAGCATGGAAT 
ACAGAAAACAACAAAAAACTTAAGCTTTAATTTCATCTGGAATTCCACAGTTTTCTTAGCTCCCTGGACCC 
GGTTGACCTGTTGGCTCTTCCCGCTGGCTGCTCTATCACGTGGTGCTCTCCGACTACTCACCCCGAGTGTA 
AAGAACCTTCGGCTCGCGTGCTTCTGAGCTGCTGTGGATGGCCTCGGCTCTCTGGACTGTCCTTCCGAGTA 
GGATGTCACTGAGATCCCTCAAATGGAGCCTCCTGCTGCTGTCACTCCTGAGTTTCTTTGTGATGTGGTAC 
CTCAGCCTTCCCCACTACAATGTGATAGAACGCGTGAACTGGATGTACTTCTATGAGTATGAGCCGATTTA 
l^i- CAGACAAGACTTTCACTTCACACTTCGAGAGCATTCAAACTGCTCTCATCAAAATCCATTTCTGGTCATTC 
Q TGGTGACCTCCCACCCTTCAGATGTGAAAGCCAGGCAGGCCATTAGAGTTACTTGGGGTGAAAAAAAGTCT 
Wl TGGTGGGGATATGAGGTTCTTACATTTTTCTTATTAGGCCAAGAGGCTGAAAAGGAAGACAAAATGTTGGC 
f[ AT T GTCCT T AGAGG AT GAAC ACCT TC T T T ATGGTGACAT AATCCGACAAGAT TTTTTAGACAC AT AT AAT A 

ACCTGACCTTGAAAACCATTATGGCATTCAGGTGGGTAACTGAGTTTTGCCCCAATGCCAAGTACGTAATG 

!J" AAGACAGAC ACT GATGTT TTC ATC AAT ACT GGC AAT TT AGTGAAGT AT CT TT T AAACCT AAACC ACTC AG A 

Cil 

5^75 GAAGTT TTTCAC AGGT T ATCCTCTAATT GATAATT ATT C CT ATAGAGGAT T T T ACC AAAAAACCC AT AT T T 

w 

B CTT ACCAGGAGT ATCC TT T C AAGGTGT T CC CTCCAT ACT GC AGTGGGT TGGGTT AT AT AATGT CCAGAGAT 

Ft T T GGTG CCAAGG AT CT AT GAAATGAT GGGT C ACGT AAAACC C AT CAAGTT T GAAGATGT T TAT GTCGGG AT 

HI CTGTTTGAATTTATTAAAAGTGAACATTCATATTCCAGAAGACACAAATCTTTTCTTTCTATATAGAATCC 
AT TT GGATGT CT GTC AAC TG AGAC GT GT GATT GC AGCC C AT GGC TTTT CT T C C AAGGAGATC ATC ACTT T T 
T GGC AGGTC ATG CT AAGGAACACC AC AT GC C ATT AT TAACT T C AC AT T CT AC AAAAAGCC TAGAAG G ACAG 



W# GATACCTTGTGGAAAGTGTTAAATAAAGTAGGTACTGTGGAAAATTCATGGGGAGGTCAGTGTGCTGGCTT 



n 



ACAC TGAACT GAAACT CAT G AAAAAC CC AG ACT G G AGACT GGAGGGT TAG ACTT GT GATT TATT AGTC AGG 
CCCT TC AAAG AT G AT ATGT GGAGGAAT T AAAT AT AAAG GAAT TGG AGGT T TT T GCT AAAG AAATT AAT AGG 
ACCAAAC AATTTGGAC AT GTCATT CTGT AGACT AGAAT TT CTTAAAAGGGTGTTACTGAGTT ATAAGCTCA 
CT AGGCTGT AAAAACAAAAC AATG T AGAGTTT TAT T TAT T GAAC AATGTAGT C ACT T G AAGGT TT T GT GT A 
T ATCTT ATGTGG ATTACCAATTTAAAAAT ATATGTAGT TCTGTGTCAAAAAACTTCTTCACT GAAGTT ATA 
CT GAAC AAAATT T T AC CT GTTT TT GGTC AT TT AT AAAGTACT TCAAGAT GT T GC AGT AT T TC AC AGTT ATT 
ATTATTT AAAATT ACT TCAACTTTGTGT TTTT AAAT GTTT TGACGATT TCAATACAAG AT AAAAAGGAT AG 
T G AATC ATTCTT T ACATGC AAACATT TT CC AGTT AC T T AAC T G ATCAGT T TAT T AT TG AT AC ATC ACT CCA 
TTAATGTAAAGTCATAGGTCATTATTGCATATCAGTAATCTCTTGGACTTTGTTAAATATTTTACTGTGGT 
AAT AT AG AG AAG AAT T AAAGC AAGAAAATC TGAAAA 
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FIGURE 36 



MASAL W T VLP S RMS LRS LKWSLLLLSLLS FFVMW YLS LPH YNVI ERVNWMYFYE YE P I YRQDFH F 
TLREHSNCSHQNPFLVILVTSHPSDVKARQAIRVTWGEKKSWWGYEVLTFFLLGQEAEKEDKMLA 
LSLEDEHLLYGDIIRQDFLDTYNNLTLKTimFRWVTEFCPNAKYVMKTDTDVFINTGNLVKYLL 
NLNHSEKFFTGYPL I DNYS YRGFYQKTH I S YQE YP FKVFP P YC S GLGY IMSRDLVPRI YEMMGHV 
KPIKFEDVYVGICLNLLKWIHIPEDTNLFFLYRIHLDVCQLRi^VIAAHGFSSKEIITFWQVMLR 
NTTCHY 

Important features: 

Type II transmembrane domain: 

amino acids 20-39 



?n N-glycosylation sites. 

fl amino acids* 72-76, 154-158, 198-202, 212-216, 326-330 



Glycosaminoglycan attachment site. 

amino acids 239-243 

Ly-6 / u-PAR domain proteins. 



fU amino acids 23-37 



N-myristoylation site. 

amino acids 271-277 
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FIGURE a? 



m 



?•■•?• 



CGCTCGGGCACCAGCCGCGGCAAGGATGGAGCTGGGTTGCTGGACGCAGTTGGGGCTCACTTTTCTTCAGCTCCTTCTCATC 
TCGTCCTTGCCAAGAGAGTACACAGTCATTAATGAAGCCTGCCCTGGAGCAGAGTGGAATATCATGTGTCGGGAGTGCTGTG 
AATAT GAT CAGATTGAGTGC GTC T GCCCC GGAAAGAGGGAAG T C GTGGGTTAT ACCATCCC TTGCTGCAGGAATGAGGAGAA 
TGAGT GT GAC TC CTGC C TGAT CC ACCC AGGT TGTACCATC TT T GAAAAC TGC AAGAGC TGCCGAAATGGCT CATGGGGGGGT 
ACCTTGGATGAC TTC T ATGTGAAGGGG T T C T AC TGTGCAGAGT GC CGAGC AGGC TGGTACGGAGGAGAC TGC ATGCGATGTG 
GCCAGGTT CTGCGAGCC CC AAAGGGTCAGATTTTGT TGGAAAGCTATCCCC T AAATGC TCACTGTGAATGGAC CATT CAT GC 
TAAACCT GGGT T TGTC ATC CAAC TAAGAT T TGTCATGTTGAGT CTGGAGT T T GACTACATGTGCCAGTATGAC TATGT T GAG 
GT T CGTGATGGAGAC AACC GC GAT GGCCAGATCAT CAAGC GTGTCTGTGGC AACGAGCGGC CAGC TCC TATCCAGAGCAT AG 
GAT CC TCAC TCC ACGTC CT CT TC C AC T C C GATGGCTCCAAGAAT T TT GACGGT TTC C ATGC CAT T TATGAGGAGATCAC AGC 
ATGCTCCTCATCCCCTTGTTTCCATGACGGCACGTGCGTCCTTGACAAGGCTGGATCTTACAAGTGTGCCTGCTTGGCAGGC 
TATAC TG GG CAGCGC TGTGAAAAT C TCC T TGAAGAAAGAAAC TGC TC AGACC C TGGGGGCC CAGTCAATGGGTAC CAGAAAA 
TAACAGGGGGCCCTGGGCTTATCAACGGACGCCATGCTAAAATTGGCACCGTGGTGTCTTTCTTTTGTAACAACTCCTATGT 
TCTTAGT GGCAAT GAGAAAAGAAC TTGCC AGCAGAATGGAGAGT GGTCAGGGAAAC AGCCCAT CTGCATAAAAGC CT GCC GA 
GAACCAAAGATT T CAGACC TGGTGAGAAGGAGAGT TCTTC CGAT GCAGG TTC AGTCAAGGGAGACACCATTACACCAGCT AT 
ACTCAGC GGCCT T CAGCAAGC AGAAAC TGCAGAGT GCCCC TAC CAAGAAGCC AGCCC T TCCCTTTGGAGAT CT GCCCATGGG 
ATACCAACATCTGCATACCCAGCTCCAGTATGAGTGCATCTCACCCTTCTACCGCCGCCTGGGCAGCAGCAGGAGGACATGT 
? CTGAGGAC TGGGAAGTGGAGTGGGCGGGC AC CATC C TGCATCC C TATC TGCGGGAAAATTGAGAACAT CAC TGCT C C AAAGA 

31 CCC AAGGGTTGC GC T GGCC GT GGC AGGCAGC CATC TACAGGAGGACC AGCGGGGTGC ATGACGGCAGC CTAC ACAAGGGAGC 

GTGGTTCCTAGTCTGCAGCGGTGCCCTGGTGAATGAGCGCACTGTGGTGGTGGCTGCCCACTGTGTTACTGACCTGGGGAAG 

m 

*F? G TCAC CAT GATC AAGACAGCAGAC C TGAAAGTTGT T TTGGGGAAATT C TAC C GGGAT GATGACCGGGATGAGAAGAC CAT CC 

AGAGCCTACAGAT TT C T GC TATC AT TC TGCATCCCAACTAT GACCCC AT CC T GC T T GATGC TGACATCGCCAT CCT GAAGCT 
CCTAGACAAGGCC C GT ATCAGCAC CCGAGTCC AGC CCATC TGC C TCGC T GCC AGT C GGGAT CTCAGCAC TT C C TTCCAGGAG 
hi TCC CACATCACTG TGGC TGGC TGGAATGTCCTGGCAGACGTGAGGAGCCCTGGCT TCAAGAACGACACACTGCGCTCTGGGG 

I'H TGG TCAGTGTGGTGGAC TC GC TGC TGTGT GAGGAGCAGCATGAGGAC C ATGGCAT CC CAGTGAGTGTC ACTGATAAC AT GT T 

C TGTGCCAGCTGGGAAC CCAC TGC CCC TT CTGATAT C TGCAC T GCAGAGACAGGAGGCATC GCGGCTGTGTCC TT C CCGGGA 
CGAGCAT C TCC TGAGCC AC GC TGGCATCTGATGGGACTGGTC AGCTGGAGC TATGAT AAAACATGCAGCCACAGGC T C TCC A 
C TG CC TT CACC AAGGTGCT GC C T T TTAAAGAC TGGAT TGAAAGAAAT AT GAAAT GAACCAT GC TCATGCAC TC CT T GAGAAG 
TGTTTCTGTATATCCGTCTGTACGTGTGTCATTGCGTGAAGCAGTGTGGGCCTGAAGTGTGATTTGGCCTGTGAACTTGGCT 
GTG CCAGGGC TTC TGAC TT C AGGGACAAAAC TCAGTGAAG G GT GAGT AGACC TCCAT TGC T GGTAGGC TGATGCC GC G TC C A 
CTACTAGGACAGCCAATTGGAAGATGCCAGGGC TTGCAAGAAGTAAGT T TCTTCAAAGAAGACCATATACAAAACCTCTCCA 
CTC CACT GACC T GGTGGTC T T CCC CAAC T TTCAGT TATAC GAATGCC AT C AGCT TGACCAGGGAAGATC TGGGC T TCATGAG 
GCCCCTTTTGAGGCTCTCAAGTTCTAGAGAGCTGCCTGTGGGACAGCCCAGGGCAGCAGAGCTGGGATGTGGTGCATGCCTT 
TGTGTACATGGCCACAGTACAGTCTGGTCCTTTTCCTTCCCCATCTCTTGTACACATTTTAATAAAATAAGGGTTGGCTTCT 
GAACTACAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAW 
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FIGURE 38 

MELGCWTQLGLTFLQLLLISSLPREYTVINEACPGAEWNIMCRECCEYDQIECVCPGKREWGYT 
IPCCRNEENECDSCLIHPGCTIFENCKSCRNGSWGGTLDDFYVKGFYCAECRAGWYGGDCMRCGQ 
VLRAPKGQILLESYPLNAHCEWTIHAKPGFVIQLRFVMLSLEFDYMCQYDYVEVRDGDNRDGQII 
KRVCGNERPAPIQSIGSSLHVLFHSDGSKNFDGFHAIYEEITACSSSPCFHDGTCVLDKAGSYKC 
ACLAGYTGQRCENLLEERNCSDPGGPVNGYQKITGGPGLINGRHAKIGTWSFFCNNSYVLSGNE 
KRTCQQNGEWSGKQPICIKACREPKISDLVRRRVLPMQVQSRETPLHQLYSAAFSKQKLQSAPTK 
KPALPFGDLPMGYQHLHTQLQYECISPFYRRLGSSRRTCLRTGKWSGRAPSCIPICGKIENITAP 
KTQGLRWPWQAAIYRRTSGVHDGSLHKGAWFLVCSGALVNERTVWAAHCVTDLGKVTMIKTADL 
KWLGKFYRDDDRDEKT I QSLQISAI ILHPNYDP I LLDADIAI LKLLDKARI S TRVQP I CLAASR 
DLSTSFQESHITVAGWNVLADVRSPGFKNDTLRSGWSWDSLLCEEQHEDHGIPVSVTDNMFCA 
SWEPTAPSDICTAETGGIAAVSFPGRASPEPRWHLMGLVSWSYDKTCSHRLSTAFTKVLPFKDWI 
ERNMK 

Important features of* the protein: 
Signal peptide: 

amino acids 1-2 3 

EGF-like domain cysteine pattern signature, 
amino acids 2 60-272 
N-glycosylation sites. 

amino acids 96-100, 279-283, 316-320, 451-455, 614-618 
N-myristoylation sites. 

amino acids 35-41, 97-103, 256-262, 284-290, 298-304, 308-314, 

474-480, 491-497, 638-644, 666-672 

Amidation site. 

amino acids 56-60 

Serine proteases, trypsin family. 

amino acids 489-506 

CUB domain proteins profile. 

amino acids 150-167 
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FIGURE 3Q 



m 



GGTTCCTACATCCTCTCATCTGAGAATCAGAGAGCATAATCTTCTTACGGGCCCGTGATTTATTAACGTGGCTTAATC 
TGAAGGTTCTCAGTCAAATTCTTTGTGATCTACTGATTGTGGGGGCATGGCAAGGTTTGCTTAAAGGAGCTTGGCTGG 
TTTGGGCC CTT GTAGCT GACAGAAG GT GGC CAGG GAGAAT GCAGCAC ACT GCT CGGAG AATGA AGGCGCT T C TGT T GC 
T GGTCTTGCCTT GGCT CAGTCCTGCTAACTACATTGACAAT GT GGGCAACCTGCACTT CCT GTATTCAGAACTCT GTA 
AAGGT G CCT C CC ACTACG GCCT GAC CAAAGATAGGAAGAGGCGCT CACAAGAT GGCTGTC C AGAC G GCT GT GCGAGCC 
T CACAGCCACGGCTCCCTCCCCAGAGGTTT CTGCAGCTGCCACCATCT CCTTAATGACAGACGAGCCTGGCCTAGACA 
ACCCTGCCTACGTGTCCTCGGCAGAGGACGGGCAGCCAGCAATCAGCCCAGTGGACTCTGGCCGGAGCAACCGAACTA 
GGGCACGGCCCTTTGAGAGATCCACTATTAGAAGCAGATCATTTAAAAAAATAAATCGAGCTTTGAGTGTTCTTCGAA 
GGACAAAGAGCGGGAGT GCAGT T GCC AACCATGCC GACC AGGGC AGGGAAAATT CT GAAAACAC CACT GCCCCT GAAG 
T CTTTCCAAGGTTGTACCACCTGATT CCAGATGGT GAAATTAC CAGCAT CAAGAT CAATCGAGTAGAT CCCAGT GAAA 
GCCTCTCT ATTAG GCT G GT GGGAGGT AGC GAAAC CC CACT GGT CCATATCATTATCCAACACATTTAT CGT GATGGGG 
T GATC GCCAGAGACGGCCGGCT ACTGCCAGGAGACATCAT T CT AAAGGTCAAC GGGAT GGACAT C AGCAAT GT C C CTC 
ACAACTAC GCTGT GC GT CT CCT GCGGCAGCCCT GC CAGGT GCT GT GGCTGACT GT GAT GC GT GAACAGAAGTT CCGCA 
GCAGGAACAATGGACAGGCCCCGGAT GCCTACAGAC CCCGAGAT GAC AG CTTT CATGT GAT T CT CAACAAAAGTAGCC 
CCGAGGAGCAGCTTGGAATAAAACTGGTGCGCAAGGTGGATGAGCCTGGGGTTTTCATCTTCAATGTGCTGGATGGCG 
GT GTGGCAT AT C GACATGGT CAGCTT GAGGAGAATGACC GTGT GT TAG C C ATCAATGGACAT GAT CTT CGATAT GGCA 
GCCCAGAAAGT GCGGCTCAT CT GATT CAGGCCAGT GAAAGACGT GTT CAC CTC GT C GT GT CCCGC CAGGT T CGGC AGC 
GGAGC CCT GACAT CTT TCAGGAAGC C GG CT GGAACAGCAAT GG CAGCT GGT CC C CAGGGC CAG GGGAGAGGAGCAACA 
CTCCCAAGCCCCTCCATCCTACAATTACTTGTCATGAGAAGGTGGTAAATATCCAAAAAGACCCCGGTGAATCTCTCG 
GCAT GACCGT CGCAGGGGGAGCAT CACAT AGAGAAT GGGAT TT GCCTATCT ATGT CAT CAGT GTT GAGC C C GGAGGAG 
T CATAAGCAGAGATGGAAGAATAAAAACAGGTGACATTTT GTTGAAT GT GGAT GGGGT C GAACT GACAGAGGTCAGCC 
GGAGTGAGGCAGTGGCATTATTGAAAAGAACAT CAT CCTCGATAGTACTCAAAGCTTT GGAAGTCAAAGAGTATGAGC 
C CCAGGAAGACT GCAGCAGCC C AGCAGC C CT GGAC T C CAACCAC AACAT GGCC C CACC CAGT GACT GGT C C CCAT CCT 
G GGT CATGT GGCTGGAAT T AC CACGGTGCTT GT ATAACTGT AAAGATATT GTAT TACGAAGAAACACAGCT GGAAGTC 
T GGGCTTCTGCATTGTAGGAGGTTAT GAAGAAT ACAAT GGAAACAAAC CT TTTT T CAT CAAAT CC ATT GTT GAAGGAA 
CACCAGCATACAAT GATGGAAGAATT AGAT G TG GTGATATT CTT CTTGCT GTCAATGGTAGAAGTACATCAGGAAT GA 
T ACAT GCTT GCT TGGCAAGACT GCT GAAA.GAACT TAAAGGAAGAATTACT CTAAC TAT TGTTTCTTGGC CT GG CACTT 
TTTTATAGAATCAAT GATGGGT CAGAGGAAAACAGAAAAAT CACAAATAGGCTAAGAAGTT GAAACACTATATTTATC 
TTGTCAGTTTTTATATTTAAAGAAAGAATACATTGTAAAAATGTCAGGAAAAGTATGATCATCTAATGAAAGCCAGTT 
ACACCTCAGAAAATATGATTCCAAAAAAATTAAAACTACTAGTTTTTTTTCAGTGTGGAGGATTTCTCATTACTCTAC 
AACAT T GTT TAT AT TT TTT CTAT T CAATAAAAAGC C CTAAAAC AACT AAAATGAT T GATTT GTATACC C CACT GAATT 
CAAGCTGATT TAAAT T TAAAATT T GGTAT AT GCT GAAGT CT GC CAAGGGT ACATTAT GGC CAT T TTTAATTTACAGCT 
AAAAT ATTTTTTAAAATGCATT GCTGAGAAACGTTGCTTT CAT CAAACAAGAATAAATATTTTT CAGAAGTTAAA 
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FIGURE 40 



m 

CJ 

ri 



T.yr 



MKALLLLVLPWLSPANYIDNVGNLHFLYSELCKGASHYGLTKDRKRRSQDGCPDGCASLTATAPS 
PEVSAAATISLMTDEPGLDNPAYVSSAEDGQPAISPVDSGRSNRTRARPFERSTIRSRSFKKINR 
ALS VLRRTKS G S AVANHADQGRENS ENT TAPE VFPRL YHL I PDGE I T S I KI NRVDP S E S L S IRLV 
GGSET PLVHI I I QH I YRDGVI ARDGRLLPGDI ILKVNGMD I SNVPHNYAVRLLRQPCQVLWLTVM 
REQKFRSRNNGQAPDAYRPRDDS FHVI LNKSSPEEQLGI KLVRKVDEPGVFI FNVLDGGVAYRHG 
QLEENDRVLAINGHDLRYGSPESAAHLIQASERRVHLWSRQVRQRSPDIFQEAGWNSNGSWSPG 
PGERSNTPKPLHPT I TCHEKWNI QKDPGESLGMT VAGGASHREWDLPI YVI SVEPGGVISRDGR 
IKTGDILLNVDGVELTEVSRSEAVALLKRTSSSIVLKALEVKEYEPQEDCSSPAALDSNHNMAPP 
SDWSPSWVMWLELPRCLYNCKDIVLRRNTAGSLGFCIVGGYEEYNGNKPFFIKSIVEGTPAYNDG 
RIRCGDILLAVNGRSTSGMIHACLARLLKELKGRITLTIVSWPGTFL 

Important features: 
Signal peptide: 

amino acids 1-15 

N-glycosylation sites. 

amino acids 108-112/ 157-161, 289-293, 384-388 



ill Tyrosine kinase phosphorylation sites. 

amino acids 433-441, 492-500 



N-myristoylation sites. 

amino acids 51-57, 141-147, 233-239, 344-350, 423-429, 447-453, 
467-473, 603-609 
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FIGURE 41 



ACCAGGCAT TGTATCT TCAGTTG T CAT CAAGT T CGCAAT CAGATT GGAAAAGCTCAACTT GAAGCTTT 
CTTGCCTGCAGTGAAGCAGAGAGATAGATATTATTCACGTAATAAAAAAC ATG GGCTTCAACCTGACT 
TTCCACCTTTCCTACAAATTCCGATTACTGTTGCTGTTGACTTTGTGCCTGACAGTGGTTGGGTGGGC 
CACCAGTAACTACTTCGTGGGTGCCATTCAAGAGATTCCTAAAGCAAAGGAGTTCATGGCTAATTTCC 
ATAAGAC CCTCATTTT GGGGAAGGGAAAAACTCTGACTAATGAAGCAT CCACGAAGAAGGTAGAACTT 
GACAACTGTCCTTCTGTGTCTCCTTACCTCAGAGGCCAGAGCAAGCTCATTTTCAAACCAGATCTCAC 
TTTGGAAGAGGTACAGGCAGAAAATCCCAAAGTGTCCAGAGGCCGGTATCGCCCTCAGGAATGTAAAG 
CTTTACAGAGGGTCGCCATCCTCGTTCCCCACCGGAACAGAGAGAAACACCTGATGTACCTGCTGGAA 
:jf CATCTGCATCCCTTCCTGCAGAGGCAGCAGCTGGATTATGGCATCTACGTCATCCACCAGGCTGAAGG 
TAAAAAGTT TAATCGAGCCAAACT CTT GAATGT GGGC T AT CTAGAAGCC CTCAAGGAAGAAAAT T GGG 
i'li ACT GCTT TATATT CCACGAT GTGGACCTGGTACCCGAGAATGACTTT AACCT T TACAAGT GTGAGGAG 

?Sl CATCCCAAGCATCTGGTGGTTGGCAGGAACAGCACTGGGTACAGGTTACGTTACAGTGGATATTTTGG 
Q GGGTGTTACTGCCCTAAGCAGAGAGCAGTTTTTCAAGGTGAATGGATTCTCTAACAACTACTGGGGAT 

GGGGAGGCGAAGACGATGAC CTCAGACTCAGGGT TGAGCT CCAAAGAATGAAAATT TCC CGGCCCCTG 
CCT GAAGTGGGTAAAT ATACAATGGTCTT CCAC ACTAGAGACAAAGGCAAT GAGGT GAAC GCAGAACG 
GAT GAAGCT CT TACACCAAGTGTCACGAGTCTGGAGAACAGATGGGTTGAGTAGTT GTTC TTATAAAT 
TAG TAT C T G T G GAAC AC AAT C CT T TAT AT AT C AACAT C AC AGT G GAT T T CT GG T T T G GT G C ATGACC C 
T GGATCT T TTGGT GAT GTTT GGAAGAACT GATT CTT T GT T TGCAATAAT T T T GGCCTAGAGACT T CAA 
AT AG TAG C AC AC ATT AAGAAC CT G T TAC AGC T CAT T GTT GAGC TGAATT T TTCCTTT TT GTATT T TCT 
TAGCAGAGCTCCT GGT GATGTAGAGTATAAAACAGT T GTAACAAGACAGCTT TCTTAGTCATT T T GAT 
CAT GAGGGTTAAATAT TGTAATAT GGATACT TGAAGGACTTTATATAAAAGGATGACTCAAAGGATAA 
AAT GAAC GCTATT TGAGGAC T CT GGTT GAAGGAGATT TATTTAAAT T T GAAGTAATATAT TAT GGGAT 
AAAAG GC C ACAGGAAAT AAGACT G C T GAAT G TC T GAGAGAAC CAGAG T T GT T C TC GT CC AAG G T AGAA 
AGGTACGAAGATACAATACTGTTATTCATTTATCCTGTACAATCATCTGTGAAGTGGTGGTGTCAGGT 
GAGAAGGCGTCCACAAAAGAGGGGAGAAAAGGC GACGAAT CAGGACACAGT GAACT TGGGAATGAAGA 
GGTAGCAGGAGGGTGGAGTGTCGGCTGCAAAGGCAGCAGTAGCTGAGCTGGTTGCAGGTGCTGATAGC 
CTTCAGGGGAGGACCTGCCCAGGTATGCCTTCCAGTGATGCCCACCAGAGAATACATTCTCTATTAGT 
TTTTAAAGAGTTTTTGTAAAATGATTTTGTACAAGTAGGATATGAATTAGCAGTTTACAAGTTTACAT 
ATT AACT AATAATAAATAT GT CTATCAAATACC T CT GTAGTAAAAT GTGAAAAAGCAAAA 



3 
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FIGURE 42 



MGFNLTFHLSYKFRLLLLLTLCLTWGWATSNYFVGAIQEIPKAKEFMANFHKTLILGKGKTLTN 
EASTKKVELDNCPSVSPYLRGQSKLIFKPDLTLEEVQAENPKVSRGRYRPQECKALQRVAILVPH 
RmEKHLMYLLEHLHPFLQRQQLDYGIYVIHQAEGKKFNRAKLLNVGYLEALKEENWDCFIFHDV 
DLVPENDFNLYKCEEHPKHLWGRNSTGYRLRYSGYFGGVTAL5REQFFKVNGFSNNYWGWGGED 
DDLRLRVELQRMKISRPLPEVGKYTMVFHTRDKGNEWAERMKLLHQVSRVWRTDGLSSCSYKLV 
SVEHNPLYINI TVDFWFGA 



*«« Important features: 

M 

Eji Signal peptide: 

CP* amino acids 1-27 

~ fr 
Hi 

X N-glycosylation sites. 

y 

m amino acids 4-8, 220-224, 335-339 

3 



Xylose isomerase proteins. 

amino acids 191-202 
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FIGURE 43 



GCTCAAGACCCAGCAGTGGGACAGCCAGACAGACGGCACGATGGCACTGAGCTCCCAGATCTGGG 
CCGCTTGCCTCCTGCTCCTCCTCCTCCTCGCCAGCCTGACCAGTGGCTCTGTTTTCCCACAACAG 
ACGGGACAACTTGCAGAGCTGCAACCCCAGGACAGAGCTGGAGCCAGGGCCAGCTGGATGCCCAT 
GTTCCAGAGGCGAAGGAGGGGAGACACCCACTTCCCCATCTGCATTTTCTGCTGCGGCTGCTGTC 
ATCGATCAAAGTGTGGGATGTGCTGCAAGACGTAGAACCTACCTGCCCTGCCCCCGTCCCCTCCC 
TTCCTTATTTATTCCTGCTGCCCCAGAACATAGGTCTTGGAATAAAATGGCTGGTTCTTTTGTTT 
T CCAAAAAAAAAjWWyWiAAAAA 



i#l AAAAAZWW\AAAAAAAAAAAAAAAAAAAA 

o 
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FIGURE 44 

MALSSQIWAACLLLLLLLASLTSGSVFPQQTGQLAELQPQDRAGAI^SWMPMFQRRRRRDTHFPI 
C I FCCGCCHRSKCGMCCKT 

Important features: 
Signal peptide: 

amino acids 1-24 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 58-59 

N-myristoylation site. 

amino acids 4 4-50 

Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 1-12 
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FIGURE A* 



GT G G C T T CAT T T CAG T GGC T GAC T T C CAGAGAGC AAT ATG G C T GG T T CCC CAAC AT GC C T CACC C 
TCATCTATATCCTTTGGCAGCTCACAGGGTCAGCAGCCTCTGGACCCGTGAAAGAGCTGGTCGGT 
TCCGTTGGTGGGGCCGTGACTTTCCCCCTGAAGTCCAAAGTAAAGCAAGTTGACTCTATTGTCTG 
GACCT TCAACACAACCCCTCTTGTCACCATACAGCCAGAAGGGGGCACTAT CATAGTGACCCAAA 
ATC G T AATAGGGAGAGAGTAGAC T T CC CAGATGGAGGC TAC TC CC TGAAGCTCAGCAAAC TGAAG 
AAGAATGACTCAGGGATCTACTATGTGGGGATATACAGCTCATCACTCCAGCAGCCCTCCACCCA 
GGAGTACGTGCTGCATGTCTACGAGCACCTGTCAAAGCCTAAAGTCACCATGGGTCTGCAGAGCA 
ATAAGAAT GGCACCT GTGTGACCAAT CTGACATGCT GCAT GGAACAT GGGGAAGAGGATGTGAT T 
TATACCTGGAAGGCCCTGGGGCAAGCAGCCAATGAGTCCCATAATGGGTCCATCCTCCCCATCTC 
CTGGAGATGGGGAGAAAGTGATATGACCTTCATCTGCGTTGCCAGGAACCCTGTCAGCAGAAACT 
TCTCAAGCCCCATCCTTGCCAGGAAGCTCTGTGAAGGTGCTGCTGATGACCCAGATTCCTCCATG 
GTCCTCCTGTGTCTCCTGTTGGTGCCCCTCCTGCTCAGTCTCTTTGTACTGGGGCTATTTCTTTG 
GT TT C TGAAGAGAGAGAGACAAGAAGAGTACAT TGAAGAGAAGAAGAGAGT GGACAT T TGT C GGG 
AAACTCCTAACATAT GCCCCCAT TCTGGAGAGAACACAGAGTACGACACAAT CCC TCACAC T AAT 
AGAACAATCCTAAAGGAAGATCCAGCAAATACGGTTTACTCCACTGTGGAAATACCGAAAAAGAT 
GGAAAATCCCCACTCACTGCTCACGATGCCAGACACACCAAGGCTATTTGCCTATGAGAATGTTA 
TCTAGACAGCAGTGCACTCCCCTAAGTCTCTGCTCA 
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FIGURE 4-6 

MAGS PTCLTL I Y I LWQLTGSAAS G PVKELVGS VGGAVT FPLKSKVKQVDS I VWTFNT T PLVT I QP 
EGGTIIVTQNRNRERVDFPDGGYSLKLSKLKKNDSGIYYVGIYSSSLQQPSTQEYVLHVYEHLSK 
PKVTMGLQSNKNGTCYTNLTCCMEHGEEDVI YTWKALGQAANESHNGS I LP I SWRWGESDMTFIC 
VARNPVSRNFSSPIl^KLCEGAADDPDSSKVLLCLLLVPLLLSLFVLGLFLWFLKRERQEEYIE 
EKKRVDICRETPNICPHSGENTEYDTIPHTNRTILKEDPANTVYSTVEIPKKMENPHSLLTMPDT 
PRLFAYENVI 

^1 Important features: 

fh Signal peptide: 

if] amino acids 1-22 

Transmembrane domain: 

it~- amino acids 224-250 

III- 

2 

P| Leucine zipper pattern. 

ill amino acids 229-251 



23-2 N-glycosylation sites. 

m amino acids 98-102, 142-146, 148-152, 172-176, 176-180, 204-208, 

291-295 
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FIGURE 47 



GGCTCGAGCGTTTCTGAGCGAGGGGTGACC ATGA CCTGCTGCGAAGGATGGACATCCTGCAATGG 

ATTCAGCCTGCTGGTTCTACTGCTGTTAGGAGTAGTTCTCAATGCGATACCTCTAATTGTCAGCT 

TAGTTGAGGAAGACCAATTTTCTCAAAACCCCATCTCTTGCTTTGAGTGGTGGTTCCCAGGAATT 

ATAGGAGCAGGTCTGATGGCCATTCCAGCAACAACAATGTCCTTGACAGCAAGAAAAAGAGCGTG 

CTGCAACAACAGAACT GGAATGT TT CT T TC ATCATT TT TC AGTGTGATCACAGTCAT TGGTGCTC 

TGTATTGCATGCTGATATCCATCCAGGCTCTCTTAAAAGGTCCTCTCATGTGTAATTCTCCAAGC 

AAC AG TAAT G C C AAT T G T GAAT T T T CAT T G AAAAAC AT CAG T G AC AT T C AT CCAGAAT CC T T C AA 

l-_ CT T GCAGT GG T TTTTCAAT GACT CT TGTGCAC C T CC TAC T GGT T TCAAT AAACCCACCAGTAAC G 

?** ACACCATGGCGAG T GGC T GGAGAGC AT CTAGTT TCCACT T CGAT TCT GAAGAAAACAAAC AT AGG 

Hi CTTATCCACTTCTCAGTATTTTTAGGTCTATTGCTTGTTGGAATTCTGGAGGTCCTGTTTGGGCT 

III CAGTCAGAT AG T CAT CGGT TTCCTTGGCTGTCT G TGT GGAG TC T C TAAGCGAAGAAGTCAAATT G 

^ s T G TAGTT TAAT GGGAATAAAAT GTAAGTAT CAGTAGT T T GAAAAAAAAAAA 
ffS 

->£y ■ 

res 
if? 



□ 
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FIGURE 48 
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Hi 



o 
II 



MTCCEGWTSCNGFSLLVLLLLGWLNAI PL I VSLVEEDQ FSQNP I SC FEWWFPGI I GAGLMAI PA 
TTMS LTARKRACCNNRTGMFLS SFFSVI TVIGALYCML I S I QALLKGPLMCNSPSNSNANCE FSL 
KNISDIHPESFNLQWFFNDSCAPPTGFNKPTSNDTMASGWRASSFHFBSEENKHRLIHFSVFLGL 
LLVGILEVLFGLSQIVIGFLGCLCGVSKRRSQIV 

Important features: 
Transmembrane domains: 

amino acids 10-31 (type II), 50-72, 87-110, 191-213 
N-glycosylation sites. 

amino acids 80-84, 132-136, 148-152, 163-167 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 223-227 

N-myristoylation sites. 

amino acids 22-28, 54-60, 83-89, 97-103, 216-222 



Prokaryotic membrane lipoprotein lipid attachment site, 

amino acids 207-218 



TNFR/NGFR family cysteine-rich region protein. 

amino acids 4-12 



Apr 
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FIGURE 4Q 



ri 



ATCCGTTCTCTGCGCTGCCAGCTCAGGTGAGCCCTCGCCAAGGTGACCTCGCAGGACACTGGTGA 
AGGAGCAGTGAGGAACCTGCAGAGTCACACAGTTGCTGACCAATTGAGCTGTGAGCCTGGAGCAG 
ATCCGTGGGCTGCAGACCCCCGCCCCAGTGGCTCTCCCCCTGCAGCCCTGCCCCTCGAACTGTGA 
CATGGAGAGAGTGACCCTGGCGCTTCTCCTACTGGCAGGCCTGACTGCCTTGGAAGCCAATGACC 
CATTTGCCAATAAAGACGATCCCTTCTACTATGACTGGAAAAACCTGCAGCTGAGCGGACTGATC 
TGCGGAGGGCTCCTGGCCATTGCTGGGATCGCGGCAGTTCTGAGTGGCAAATGCAAATACAAGAG 
CAGCCAGAAGCAGCACAGTCCTGTACCTGAGAAGGCCATCCCACTCATCACTCCAGGCTCTGCCA 
CTACTTGCTGAGCACAGGACTGGCCTCCAGGGATGGCCTGAAGCCTAACACTGGCCCCCAGCACC 
TCCTCCCCTGGGAGGCCTTATCCTCAAGGAAGGACTTCTCTCCAAGGGCAGGCTGTTAGGCCCCT 



Cfl TTCTGATCAGGAGGCTTCTTTATGAATTAAACTCGCCCCACCACCCCCTCA 



ri 
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FIGURE so 



MERVTLALLLLAGLTALEANDPFANKDDPFYYDWKNLQLSGLICGGLLAIAGIAAVLSGKCKYKS 
S QKQHS PVPEKAI PL I TPGSATTC 

Important features: 
Signal peptide: 

amino acids 1-16 



Transmembrane domain: 




amino acids 36-59 



N-myristoylation sites* 

amino acids 41-47, 45-51, 34-90 



Extracellular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 . 



amino acids 54-67 



AppJD-10063602 



Page 200 of 3 18 



FIGURE 51 



GTGGACTCTGAGAAGCCCAGGCAGTTGAGGACAGGAGAGAGAAGGCTGCAGACCCAGAGGGAGGG 
AGGACAGGGAG T CGGAAGGAGGAGGAC AGAGGAGGGCACAGAGACGCAGAGCAAGGGCGGCAAGG 
AGGAGACCCTGGTGGGAGGAAGACACTCTGGAGAGAGAGGGGGCTGGGCAGAGATGAAGTTCCAG 
GGGCCCCTGGCCTGCCTCCTGCTGGCCCTCTGCCTGGGCAGTGGGGAGGCTGGCCCCCTGCAGAG 
CGGAGAGGAAAGCACTGGGACAAATATTGGGGAGGCCCTTGGACATGGCCTGGGAGACGCCCTGA 
GCGAAGGGGTGGGAAAGGCCATTGGCAAAGAGGCCGGAGGGGCAGCTGGCTCTAAAGTCAGTGAG 
GCCCTTGGCCAAGGGACCAGAGAAGCAGTTGGCACTGGAGTCAGGCAGGTTCCAGGCTTTGGCGC 
AGCAGATGCTTTGGGCAACAGGGTCGGGGAAGCAGCCCATGCTCTGGGAAACACTGGGCACGAGA 
TTGGCAGACAGGCAGAAGATGTCATTCGACACGGAGCAGATGCTGTCCGCGGCTCCTGGCAGGGG 
GTGCCTGGCCACAGTGGTGCTTGGGAAACTTCTGGAGGCCATGGCATCTTTGGCTCTCAAGGTGG 
C C T T GGAGGC CAGGGC CAGGGCAAT CC TGGAGGTC T GGGGACT CCGTGGGT CCACGGATACCCCG 
GAAACTCAGCAGGCAGCTTTGGAATGAATCCTCAGGGAGCTCCCTGGGGTCAAGGAGGCAATGGA 
m GGGCCACCAAAC T T T GGGACC AACACT CAGGGAGC T GT GGCCCAGCC T GGC TAT GGT T CAG T GAG 

AGCCAGCAACCAGAATGAAGGGTGCACGAATCCCCCACCATCTGGCTCAGGTGGAGGCTCCAGCA 
ACTCTGGGGGAGGCAGCGGCTCACAGTCGGGCAGCAGTGGCAGTGGCAGCAATGGTGACAACAAC 
AATGGCAGCAGCAGTGGTGGCAGCAGCAGTGGCAGCAGCAGTGGCAGCAGCAGTGGCGGCAGCAG 
T GGCGGC AGCAGT GG T GGCAGC AG T GGCAACAGT GGT GGC AGC AGAGG T GACAGCGGC AGT GAGT 
CCTCCTGGGGATCCAGCACCGGCTCCTCCTCCGGCAACCACGGTGGGAGCGGCGGAGGAAATGGA 
CATAAACCCGGGTGTGAAAAGCCAGGGAATGAAGCCCGCGGGAGCGGGGAATCTGGGATTCAGGG 
CTTCAGAGGACAGGGAGTTTCCAGCAACATGAGGGAAATAAGCAAAGAGGGCAATCGCCTCCTTG 
GAGGCTCTGGAGACAATTATCGGGGGCAAGGGTCGAGCTGGGGCAGTGGAGGAGGTGACGCTGTT 
GGTGGAGTCAATACTGTGAACTCTGAGACGTCTCCTGGGATGTTTAACTTTGACACTTTCTGGAA 
GAATTTTAAATCCAAGCTGGGTTTCATCAACTGGGATGCCATAAACAAGGACCAGAGAAGCTCTC 
GCATCCCG TGACCT CCAGACAAGGAGCCACCAGATTGGATGGGAGCC CCCACAC TCCC TC CTTAA 
AACACC ACCC T CT C AT CAC TAAT CT CAGCC C T TGCC C T T GAAATAAACC T.T AGC T GCC CC ACAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAZW 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE *2 



Q 
m 



m 



MKFQGPLACLLLALCLGSGEAGPLQSGEESTGTNIGEALGHGLGDALSEGVGKAIGKEAGGAAGSiCVS 
EALGQGTREAVGTGTOQVPGFGAADALGNRVGEiy^iALGNTGHEIGRQAEDVIRHGADAVRGSWQGVP 
GHSGAWETSGGHGIFGSQGGLGGQGQGNPGGLGTPWVHGYPGNSAGSFGMNPQGAPWGQGGNGGPPNF 
GTNTQGAVAQPGYGSVRASNQNEGCTNPPPSGSGGGSSNSGGGSGSQSGSSGSGSNGDNNNGSSSGGS 
SSGS SSGSS SGGS SGGS SGGSSGNSGGSRGDSGSESSWGS STGSS SGNHGGSGGGNGHKPGCEKPGNE 
ARGS GESGI QGFRGQGVS SNMRE I S KEGNRLLGGS GDN YRGQGS SWGS GGGDAVGGWTWSET S PGM 
FNFDTFWKNFKSKLGFINWDAINKDQRSSRIP 

Signal peptide: 

amino acids 1-21 



pl N-glycosylation site. 

U aitiino acids 265-269 



Glycosaminoglycan attachment site. 

amino acids 235-239, 237-241, 244-248, 255-259, 324-328, 388-392 

Casein kinase II phosphorylation site. 

amino acids 26-30, 109-113, 259-263, 300-304, 304-308 



N-myr i s toyl a t ion si te . 

amino acids 17-23, 32-38, 42-48, 50-56, 60-66, 61-67, 64-70, 74-80, 

90-96, 96-102, 130-136, 140-146, 149-155, 152-158, 155-161, 

159-165, 163-169, 178-184, 190-196, 194-200, 199-205, 218-224, 

236-242, 238-244, 239-245, 240-246, 245-251, 246-252, 249-252, 

253-259, 256-262, 266-272, 270-276, 271-277, 275-281, 279-285, 

283-289, 284-290, 287-293, 288-294, 291-297, 292-298, 295-301, 

298-304, 305-311, 311-317, 315-321, 319-325, 322-328, 323-329, 

325-331, 343-349, 354-360, 356-362, 374-380, 381-387, 383-389, 
387-393, 389-395, 395-401 

Cell attachment sequence. 

amino acids 301-304 
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FIGURE 5 3 



GGAGAAGAGGTTGTGTGGGACAAGCTGCTCCCGACAGAAGG ATG TCGCTGCTGAGCCTGCCCTGG 
CTGGGCCTCAGACCGGTGGCAATGTCCCCATGGCTACTCCTGCTGCTGGTTGTGGGCTCCTGGCT 
ACTCGCCCGCATCCTGGCTTGGACCTATGCCTTCTATAACAACTGCCGCCGGCTCCAGTGTTTCC 
CACAGCCCCCAAAACGGAACTGGTTTTGGGGTCACCTGGGCCTGATCACTCCTACAGAGGAGGGC 
TTGAAGGACTCGACCCAGATGTCGGCCACCTATTCCCAGGGCTTTACGGTATGGCTGGGTCCCAT 
CATCCCCTTCATCGTTTTATGCCACCCTGACACCATCCGGTCTATCACCAATGCCTCAGCTGCCA 
TTGCACCCAAGGATAATCTCTTCATCAGGTTCCTGAAGCCCTGGCTGGGAGAAGGGATACTGCTG 
!^ AGTGGCGGTGACAAGTGGAGCCGCCACCGTCGGATGCTGACGCCCGCCTTCCATTTCAACATCCT 
GAAGTCGTATATAACGATGTTCAACAAGAGTGCAAACATCATGCTTGACAAGTGGCAGCACCTGG 
|i CCTCAGAGGGCAGCAGTCGTCTGGACATGTTTGAGCACATCAGCCTCATGACCTTGGACAGTCTA 
W CAGAAATGCAT CT T CAGC T TT GACAGCCATTG T CAGGAGAGGCCCAGT GAATATAT TGCCACCAT 

W?' CT T GGAGC TCAGTGCCC T T GTAGAGAAAAGAAGCCAGCAT ATCC TCCAGCACATGGAC T T T CTGT 



m 



ATTACCTCTCCCATGACGGGCGGCGCTTCCACAGGGCCTGCCGCCTGGTGCATGACTTCACAGAC 
GCTGTCATCCGGGAGCGGCGTCGCACCCTCCCCACTCAGGGTATTGATGATTTTTTCAAAGACAA 
AGCCAAGTCCAAGACTTTGGATTTCATTGATGTGCTTCTGCTGAGCAAGGATGAAGATGGGAAGG 
CATTGTCAGATGAGGATATAAGAGCAGAGGCTGACACCTTCATGTTTGGAGGCCATGACACCACG 
GCCAGTGGCCTCTCCTGGGTCCTGTACAACCTTGCGAGGCACCCAGAATACCAGGAGCGCTGCCG 
ACAGGAGGTGCAAGAGCTTCTGAAGGACCGCGATCCTAAAGAGATTGAATGGGACGACCTGGCCC 



m AGC TGCCCTTCCT G AC CAT GT GC G T G AAGG AG AGCC T GAG G T T ACAT CC C C CAG C T CC C T T CAT C 

TCCCGATGCTGCACCCAGGACATTGTTCTCCCAGATGGCCGAGTCATCCCCAAAGGCATTACCTG 
CCTCATCGATATTATAGGGGTCCATCACAACCCAACTGTGTGGCCGGATCCTGAGGTCTACGACC 
CCTTCCGCTTT GAC CC AGAGAACAGCAAGGGGAGGT CACC TC T GGC T T T T AT TC C T T TC T CC GC A 
GGGCCCAGGAACTGCATCGGGCAGGCGTTCGCCATGGCGGAGATGAAAGTGGTCCTGGCGTTGAT 
GCTGCTGCACTTCCGGTTCCTGCCAGACCACACTGAGCCCCGCAGGAAGCTGGAATTGATCATGC 
GCGCCGAGGGCGGGCTTTGGCTGCGGGTGGAGCCCCTGAATGTAGGCTTGCAGTGACTTTCTGAC 
CCATCCACCTGTTTTTTTGCAGATTGTCATGAATAAAACGGTGCTGTCAAA 



App_ID=10063602 



Page 203 of 3 18 



FIGURE xa. 



m 



MSLLSLPWLGLRPVAMSPWLLLLLWGSWLLARILA^ 

LITPTEEGLKDSTQMSATYSQGFTVWLGPIIPFIVLCHPDTIRSITNASAAIAPKDNLFIRFLKP 
WLGEGILLSGGDKWSRHRRMLTPAFHFNILKSYITIFNKSANIMLDKWQHLASEGSSRLDMFEHI 
SLMTLDSLQKCIFSFDSHCQERPSEYIATILELSALVEKRSQHILQHMDFLYYLSHDGRRFHRAC 
RLVHDFTDAVIRERRRTLPTQGIDDFFKDKAKSKTLDFIDVLLLSKDEDGKALSDEDIRAEADTF 
MFGGHDTTASGLSWVLYNLARHPEYQERCRQEVQELLKDRDPKEIEWDDLAQLPFLTMCVKESLR 
LHPPAPFISRCCTQDIVLPDGRVIPKGITCLIDIIGVHHNPTWPDPEVYDPFRFDPENSKGRSP 
LAFIPFSAGPRNCIGQAFAMAEMKWLALMLLHFRFLPDHTEPRRKLELIMRAEGGLWLRVEPLN 
VGLQ 

Important features : 
Transmembrane domains : 

amino acids 13-32 (type II), 77-102 

Cytochrome P450 cysteine heme-iron ligand signature * 



W* amino acids 461-471 

m 



N-glycosylation sites. 

amino acids 112-116, 168-172 
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FIGURE rr 



ATCGCATCAATTGGGAGTACCATCTTCCTC ATGG GACCAGTGAAACAGCTGAAGCGAATGTTTGA 
GCCTACTCGTTTGATTGCAACTATCATGGTGCTGTTGTGTTTTGCACTTACCCTGTGTTCTGCCT 
TTTGGTGGCATAACAAGGGACTTGCACTTATCTTCTGCATTTTGCAGTCTTTGGCATTGACGTGG 
TACAGCCTTTCCTTCATACCATTTGCAAGGGATGCTGTGAAGAAGTGTTTTGCCGTGTGTCTTGC 
ATAAT T CATGGCCAGTT T TATGAAGCT T T GGAAGGCACTAT GGAC AGAAGCTGG TGGACAG T T T T 
GTAACTATCTTCGAAACCTCTGTCTTACAGACATGTGCCTTTTATCTTGCAGCAATGTGTTGCTT 
GTGATTCGAACATTTGAGGGTTACTTTTGGAAGCAACAATACATTCTCGAACCTGAATGTCAGTA 
l'^ GCACAGGATGAGAAGTGGGTTCTGTATCTTGTGGAGTGGAATCTTCCTCATGTACCTGTTTCCTC 
!£ TC T G GATGT TG TCC CAC T GAAT TCC CAT GAATACAAAC CT AT TC AGCAACAGCAAAAAAAAAAAA 

w 

m AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 

Id 



Pi 
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FIGURE *6 



MGPVKQLKRMFE PTRL I AT IMVLLC FALTLC S AFW WHNKGLAL I FC I LQS LALTW YS L S FI P FAR 
DAVKKCFAVCLA 

Important features: 
Signal peptide: 

amino acids 1-33 



Type II fibronectin collagen-binding domain protein . 

$h amino acids 30-72 

id 
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FIGURE 57 



CG6CTCGAGCTCGAGCCGAATCGGCTCGAGGGGCAGTGGAGCACCCAGCAGGCCGCCAAC ATG CTCTGTCTGTGCCTG 
TACGTGCCGGTCATCGGGGAAGCCCAGACCGAGTTCCAGTACTTTGAGTCGAAGGGGCTCCCTGCCGAGCTGAAGTCC 
ATTTT CAAGCTCAGT GTCTTCAT CCCCTCCCAGGAATTCT CCACCTACCGCCAGTGGAAGCAGAAAATT GTACAAGCT 
G GAGATAAGGACCTT GATGGGCAGCTAGACT TT GAAGAAT TT GTC CATTAT CT C C AAGAT CAT GAGAAGAAGCTGAGG 
CT GGT GTT TAAGATT T T GGACAAAAAGAAT GAT GGAC GCATTGAC G C G C AG GAGAT CAT GCAGT C C CT GCGGGACTT G 
GGAGTCAAGATATCT GAACAGCAGGCAGAAAAAATTCTCAAGAGCAT GGATAAAAACGGCACGATGACCAT CGACTGG 
AACGAGT GGAGAGACT AC CACCT CCT CCAC CC C GTG GAAAACAT C C C C GAGAT CATCCT CTACT GGAAGCAT T CCACG 
AT CTTTGATGTGGGT GAGAAT CT AAC GGTCCCGGAT GAGTT CACAGT GGAGGAGAGGCAGACGGGGAT GTGGTGGAGA 
CACCTGGT GGCAGGAGGTGGGGCAGGGGCCGTATCCAGAACCT GCACGGCCCCCCTGGACAGGCT CAAGGT GCTCATG 
CAGGTCCATGCCTCCCGCAGCAACAACATGGGCATCGTTGGTGGCTTCACTCAGATGATTCGAGAAGGAGGGGCCAGG 
TCACTCTGGCGGGGCAATGGCATCAACGTCCTCAAAATTGCCCCCGAATCAGCCATCAAATTCATGGCCTATGAGCAG 
ATCAAGCGCCTTGTTGGTAGTGACCAGGAGACTCTGAGGATTCACGAGAGGCTTGTGGCAGGGTCCTTGGCAGGGGCC 
ATCGCCCAGAGCAGCAT CT AC CC AAT GGAGGT C CT GAAGAC C C GGAT GGC GCTGCGGAAGACAGGC CAGTACT CAGGA 
AT GCT GGACTGCGCCAGGAGGATCCT GGCCAGAGAGGGGGT GGCCGCCTT CTACAAAGGCTATGT CCCCAACATGCTG 
GGCAT CAT CCC CTAT GCC GGCAT CGACCTT GCAGTCT ACGAGACGCT CAAGAAT GC CT GG CT GCAG CACTATGCAGT G 
AACAGCGCGGACCCCGGCGTGTTTGTGCTCCTGGCCTGTGGCACCATGTCCAGTACCTGTGGCCAGCTGGCCAGCTAC 
CCCCTGGCCCTAGTCAGGACCCGGATGCAGGCGCAAGCCTCTATTGAGGGCGCTCCGGAGGTGACCATGAGCAGCCTC 
TTCAAACATATCCTGCGGACCGAGGGGGCCTTCGGGCTGTACAGGGGGCTGGCCCCCAACTTCATGAAGGTCATCCCA 
GCT GT GAGCAT CAGCTACGTGGT CTACGAGAACCT GAAGAT CACCCTGGG C GT GCAGT CG CGGTGACGGG GGGAG GGC 
CGCCCGGCAGTGGACTCGCTGATCCTGGGCCGCAGCCTGGGGTGTGCAGCCATCTCATTCTGTGAATGTGCCAACACT 
AAGCTGTCTCGAGCCAAGCTGTGAAAACCCTAGACGCACCCGCAGGGAGGGTGGGGAGAGCTGGCAGGCCCAGGGCTT 
GTCCTGCTGACCCCAGCAGACCCTCCTGTTGGTTCCAGCGAAGACCACAGGCATTCCTTAGGGTCCAGGGTCAGCAGG 
CTCCGGGCTCACATGT GTAAGGACAGGACATTTT CT GCAGT GCCTGCCAATAGT GAGCTTGGAGCCTGGAGGCCGGCT 
TAGTTCTTCCATTTCACCCTTGCAGCCAGCTGTTGGCCACGGCCCCTGCCCTCTGGTCTGCCGTGCATCTCCCTGTGC 
CCTCTTGCTGCCTGCCTGTCTGCTGAGGTAAGGTGGGAGGAGGGCTACAGCCCACATCCCACCCCCTCGTCCAATCCC 
| If AT AAT CCATGAT GAAAGGTGAGGTCACGTGGCCTCCCAGGCCTGACTT CCCAACCTACAGCATT GACGCCAACTT GGC 

3* TGTGAAGGAAGAGGAAAGGATCTGGCCTTGTGGTCACTGGCATCTGAGCCCTGCTGATGGCTGGGGCTCTCGGGCATG 
|J CTTGGGAGTGCAGGGGGCTCGGGCTGCCTGGCCTGGCTGCACAGAAGGCAAGTGCTGGGGCTCATGGTGCTCTGAGCT 
|e| GGCCTGGACCCTGTCAGGATGGGCCCCACCTCAGAACCAAACTCACTGTCCCCACTGTGGCATGAGGGCAGTGGAGCA 
J! CCATGTTTGAGGGCGAAGGGCAGAGCGTTTGTGTGTTCTGGGGAGGGAAGGAAAAGGTGTTGGAGGCCTTAATTATGG 
13 ACTGTT GGGAAAAGGGT TTT GT CCAGAAG GACAAGC CG GACAAAT GAGC GACT T CT GT G C TT C CAGAG GAAGAC GAGG 

!*! GAGCAGGAGCTTGGCTGACTGCTCAGAGTCTGTTCTGACGCCCTGGGGGTTCCTGTCCAACCCCAGCAGGGGCGCAGC 
? ^ GGGAC CAGCCCCACATTC CACT T GT GTCAC T GCT TGGAAC CTATTTAT TTT GTATTT AT T T GAACAGAGT T ATGT C CT 

AACTAT TT TTAT AGATTTGT TT AATTAAT AGCT TGT CATTTTCAAGTT CATTT TT TAT T CATAT TTAT GTT CATGGTT 
GATTGTACCTTCCCAAGCCCGCCCAGTGGGATGGGAGGAGGAGGAGAAGGGGGGCCTT GGGCCGCTGCAGT CACAT CT 
GTCCAGAGAAATTCCTTTTGGGACTGGAGGCAGAAAAGCGGCCAGAAGGCAGCAGCCCTGGCTCCTTTCCTTTGGCAG 
GT T GGG GAAGGG CTT GCC C CCAGCCTTAG GATT TCAGG GT TTGACT GGG GGCGT G GAGAGAGAGGGAGGAACCT CAAT 
AACCTTGAAGGTGGAATCCAGTTATTTCCTGCGCTGCGAGGGTTTCTTTATTTCACTCTTTTCTGAATGTCAAGGCAG 
TGAGGTGCCTCTCACTGTGAATTTGTGGTGGGCGGGGGCTGGAGGAGAGGGTGGGGGGCTGGCTCCGTCCCTCCCAGC 
CTTCTGCTGCCCTTGCTTAACAATGCCGGCCAACTGGCGACCTCACGGTTGCACTTCCATTCCACCAGAATGACCTGA 
T GAGGAAATCTT CAATAGGATGCAAAGAT CAAT GCAAAAATTGTTATATAT GAACATATAACTGGAGT C GT CAAAAAG 
CAAATTAAGAAAGAAT T GGACGT TAGAAGT T GT CATT TAAAGC AGC CT T CT AAT AAAGTT GTTT CAAAGCT GAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 58 

MLCLCLYVPVIGEAQTEFQYFESKGLPAELKSIFKLSVFIPSQEFSTYRQWKQKIVQAGDKDLDG 
QLDFEEFVHYLQDHEKKLRLVFKILDKKNDGRIDAQEIMQSLRDLGVKISEQQAEKILKSMDKNG 
TMT I DWNEWRDYHLLHPVENI PE I ILYWKHST I FDVGENLTVPDEFTVEERQTGMWWRHLVAGGG 
AGAVSRTCTAPLDRLKVLMQVHASRS 

FMAYEQ I KRL VGS DQE TLRI HERLVAGS LAGAI AQS S I Y PMEVLKTRMALRKTGQYSGMLDCARR 
I LARE GVAAFYKG YVPNMLG I 1 P YAG I DLAVYE TLKNAWLQHYAVNS ADPGVFVLLAC G TMS S TC 
GQLAS YPLALVRTRMQAQAS IEGAPEVTMS SLFKHI LRTEQAFGLYRGLAPNFMKVIPAVS IS YV 
VYENLKITLGVQSR 

Important features: 
Signal peptide: 

amino acids 1-16 

Putative transmembrane domains: 

amino acids 284-304, 339-360, 376-394 

Mitochondrial energy transfer proteins signature. 

amino acids 206-215, 300-309 

N-glycosylation sites. 

amino acids 129-133, 169-173 

Elongation Factor-hand calcium-binding protein. 

amino acids 54-73, 85-104, 121-140 
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FIGURE 5Q 

GGAAGGCAGCGGCAGCTCCAC TCAGCCAGTACCCAGATACGCT GGGAACCT T CC CCAGCCATGGC 

T T C CCT GGGGCAGAT CC TC T T C T GGAGCATAAT TAGCAT C ATCATT AT T C T GGC TGGAGC AATT G 
CACTCATCATTGGCTTTGGTATTTCAGGGAGACACTCCATCACAGTCACTACTGTCGCCTCAGCT 
GGGAACATTGGGGAGGATGGAATCCTGAGCTGCACTTTTGAACCTGACATCAAACTTTCTGATAT 
CGT GATACAATGGCTGAAGGAAGGT GTTT TAGGCTT GGTCCATGAGTTCAAAGAAGGCAAAGAT G 
AGCTGTCGGAGCAGGATGAAATGTTCAGAGGCCGGACAGCAGTGTTTGCTGATCAAGTGATAGTT 
GGCAATGCCTCTTTGCGGCTGAAAAACGTGCAACTCACAGATGCTGGCACCTACAAATGTTATAT 
CAT CACT TCTAAAGGCAAGGGGAAT GC TAACC T T GAG TAT AAAAC TGGAGC CTTCAGCAT GC CGG 
AAGTGAATGTGGACTATAATGCCAGCTCAGAGACCTTGCGGTGTGAGGCTCCCCGATGGTTCCCC 
CAGCCCACAGTGGTCTGGGCATCCCAAGTTGACCAGGGAGCCAACTTCTCGGAAGTCTCCAATAC 
CAGCTTTGAGCTGAACTCTGAGAATGTGACCATGAAGGTTGTGTCTGTGCTCTACAATGTTACGA 
T CAACAACACATACT CCT GTATGATTGAAAATGACAT T GCCAAAGCAACAG GGGATAT CAAAGT G 
ACAGAATCGGAGATCAAAAGGCGGAGTCACCTACAGC T GC TAAAC TC AAAGGCT T C T C T GTGT GT 
CTCTTCTTTCTTTGCCATCAGCTGGGCACTTCTGCCTCTCAGCCCTTACCTGATGCTAAAATAAT 
GT GCC T TGGCCACAAAAAAGCAT GCAAAGTCAT TGTTACAACAGGGAT CT ACAGAACTATT T CAC 
CACCAGATATGACCTAGTTTTATATTTCTGGGAGGAAATGAATTCATATCTAGAAGTCTGGAGTG 
AGCAAACAAGAGCAAGAAACAAAAAGAAGCCAAAAGCAGAAGGCTCCAATATGAACAAGATAAAT 
C T AT C T T C AAAG AC AT AT T AG AAG T T GG GAAAAT AAT T CAT G T GAAC T AGAC AAG T G T G T T AAG A 
GTGATAAGTAAAATGCACGTGGAGACAAGTGCATCCCCAGATCTCAGGGACCTCCCCCTGCCTGT 
CACCTGGGGAGTGAGAGGACAGGATAGTGCATGTTCTTTGTCTCTGAATTTTTAGTTATATGTGC 
T GT AAT GTTGCTCT G AGGAAGCC CCT GGAAAG T C TAT C CC AACAT AT CC ACATCT TAT AT T C CAC 
AAATTAAGCTGTAGTATGTACCCTAAGACGCTGCTAATTGACTGCCACTTCGCAACTCAGGGGCG 
GCTGCATTTTAGTAATGGGTCAAATGATTCACTTTTTATGATGCTTCCAAAGGTGCCTTGGCTTC 
T C T T C C C AAC T GAC AAAT G CC AAAGT T GAGAAAAAT GAT CAT AAT T T TAG C AT AAAC AGAGC AGT 
CGGGGACACCGATT TTAT AAATAAACT GAGCACCT T CTT T T TAAACAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 60 

MASLGQILFWSIISI IIILAGAIALIIGFGISGRHSITVTTVASAGNIGEDGILSCTFEPDIKLS 
DIVIQWLKEGVLGLVHEFKEGKDELSEQDEMFRGRTAVFADQVIVGNASLRLKNVQLTDAGTYKC 
YI ITS KGKGNANLE YKTGAFSMPEVNVDYNAS S ETLRCEAPRWFPQPT VVWAS QVDQGANFSEVS 
NTSFELNSEWTMKWSVLYOTTINNTYSCMIENDIAKATGDIKVTESEIKRRSHLQLLNSKASL 
CVSSFFAISWALLPLSPYLMLK 



Important features: 
Signal peptide: 

m amino acids 1-28 



Transmembrane domain: 

amino acids 258-281 

N-glycosylation sites. 

amino acids 112-116, 160-164, 190-194, 196-200, 205-209, 216-220, 
220-224 

N-myristoylation sites. 

amino acids 52-58, 126-132, 188-194 
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FIGURE 61 

TGACGTCAGAATGACCATGGCCAGCTATCCTTACCGGCAGGGCTGCCCAGGAGCTGCAGGACAAG 
CACCAGGAGCCCCTCCGGGTAGCTACTACCCTGGACCCCCCAATAGTGGAGGGCAGTATGGTAGT 
GGGCTACCCCCTGGTGGTGGTTATGGGGGTCCTGCCCCTGGAGGGCCTTATGGACCACCAGCTGG 
TGGAGGGCCCTATGGACACCCCAATCCTGGGATGTTCCCCTCTGGAACTCCAGGAGGACCATATG 
GCGGTGCAGCTCCCGGGGGCCCCTATGGTCAGCCACCTCCAAGTTCCTACGGTGCCCAGCAGCCT 
GGGCTTTATGGACAGGGTGGCGCCCCTCCCAATGTGGATCCTGAGGCCTACTCCTGGTTCCAGTC 
^ GGTGGACTCAGATCACAGTGGCTATATCTCCATGAAGGAGCTAAAGCAGGCCCTGGTCAACTGCA 
ATT GGTCT T CAT TC AATGATGAGAC C T GCC TCAT GATGAT AAACAT GT T TGACAAGACCAAGTCA 
m GGCCGCATCGATGTCTACGGCTTCTCAGCCCTGTGGAAATTCATCCAGCAGTGGAAGAACCTCTT 
fsl CCAGCAGTATGACCGGGACCGCTCGGGCTCCATTAGCTACACAGAGCTGCAGCAAGCTCTGTCCC 
p AAATGGGCTACAACCTGAGCCCCCAGTTCACCCAGCTTCTGGTCTCCCGCTACTGCCCACGCTCT 
W GCCAATCCTGCCATGCAGCTTGACCGCTTCATCCAGGTGTGCACCCAGCTGCAGGTGCTGACAGA 

GGC C T TCCGGGAGAAGGACACAGCT G T ACAAGGCAACATC CGGC T CAGCT T CGAGGAC T T CGTCA 
J*| C CATGACAGC T T CT CGGATGCTATGAC CCAACCATCTGTGGAGAG TGGAGT GCACCAGGGAC C T T 

|H TCCTGGCTTCTTAGAGTGAGAGAAGTATGTGGACATCTCTTCTTTTCCTGTCCCTCTAGAAGAAC 
© ATTCTCCCTTGCTTGATGCAACACTGTTCCAAAAGAGGGTGGAGAGTCCTGCATCATAGCCACCA 
J| AATAGTGAGGACCGGGGCTGAGGCCACACAGATAGGGGCCTGATGGAGGAGAGGATAGAAGTTGA 
Jgj ATGTCCTGATGGCCATGAGCAGTTGAGTGGCACAGCCTGGCACCAGGAGCAGGTCCTTGTAATGG 

AGTTAGTGTCCAGTCAGCTGAGCTCCACCCTGATGCCAGTGGTGAGTGTTCATCGGCCTGTTACC 
GTTAGTACCTGTGTTCCCTCACCAGGCCATCCTGTCAAACGAGCCCATTTTCTCCAAAGTGGAAT 
CTGACCAAGCATGAGAGAGATCTGTCTATGGGACCAGTGGCTTGGATTCTGCCACACCCATAAAT 
CCTTGTGTGTTAACTTCTAGCTGCCTGGGGCTGGCCCTGCTCAGACAAATCTGCTCCCTGGGCAT 
CTTTGGCCAGGCTTCTGCCCCCTGCAGCTGGGACCCCTCACTTGCCTGCCATGCTCTGCTCGGCT 
TCAGTCTCCAGGAGACAGTGGTCACCTCTCCCTGCCAATACTTTTTTTAATTTGCATTTTTTTTC 
AT TTGGGGCCAAAA.GT CCAGTGAAAT TGTAAGC TTCAATAAAAGGAT GAAA.CTCTGA 
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FIGURE 62 

MASYPYRQGCPGAAGQAPGAPPGSYYPGPPNSGGQYGSGLPPGGGYGGPAPGGPYGPPAGGGPYG 
HPNPGMFPSGTPGGPYGGAAPGGPYGQPPPSSYGAQQPGLYGQGGAPPNVDPEAYSWFQSVDSDH 
SGYISMKELKQALTOCNWSSFNDETCIjMMI^FDKTKSGRIDVYGFSALWKFIQQWKNLFQQYDR 
DRSGSISYTELQQALSQMGYNLSPQFTQLLVSRYCPRSANPAMQLDRFIQVCTQLQVLTEAFREK 
DTAVQGN I RLS FEDFVTMTASRML 



la. 

;r Important features of the protein: 

Jjl Signal peptide: 

m% amino acids 1-19 



N-glycosylation site. 

amino acids 147-150 



PI Casein kinase II phosphorylation sites. 

|i amino acids 135-138, 150-153, 202-205, 271-274 



N-myristoylation sites. 

amino acids 9-14, 15-20, 19-24, 33-38, 34-39, 39-44, 43-48, 61- 
66, 70-75, 78-83, 83-88, 87-92, 110-115 



AppJD==10063602 



Page 212 of 318 



FIGURE 6s 



CAGGATGCAGGGCCGCGTGGCAGGGAGCTGCGCTCCTCTGGGCCTGCTCCTGGTCTGTCTTCATC 
TCCCAGGCCTCTTTGCCCGGAGCATCGGTGTTGTGGAGGAGAAAGTTTCCCAAAACTTCGGGACC 
AACT TGCC TCAGCT CGGACAACCTTCCTCCACTGGCCCCT C TAAC TCTGAACATCCGCAGCCCGC 
TCTGGACCCTAGGTCTAATGACTTGGCAAGGGTTCCTCTGAAGCTCAGCGTGCCTCCATCAGATG 
GCTTCCCACCTGCAGGAGGTTCTGCAGTGCAGAGGTGGCCTCCATCGTGGGGGCTGCCTGCCATG 
GATTCCTGGCCCCCTGAGGATCCTTGGCAGATGATGGCTGCTGCGGCTGAGGACCGCCTGGGGGA 
AGCGCTGCCTGAAGAACTCTCTTACCTCTCCAGTGCTGCGGCCCTCGCTCCGGGCAGTGGCCCTT 
TGCCTGGGGAGTCTTCTCCCGATGCCACAGGCCTCTCACCTGAGGCTTCACTCCTCCACCAGGAC 
TCGGAGTCCAGACGACTGCCCCGTTCTAATTCACTGGGAGCCGGGGGAAAAATCCTTTCCCAACG 
CCCTCCCTGGTCTCTCATCCACAGGGTTCTGCCTGATCACCCCTGGGGTACCCTGAATCCCAGTG 
TGTCCTGGGGAGGTGGAGGCCCTGGGACTGGTTGGGGAACGAGGCCCATGCCACACCCTGAGGGA 
ATCTGGGG TAT CAATAAT CAACCCCCAGGTACCAGC TGGGGAAATAT TAAT CGG TATC CAGGAGG 
CAGCTGGGGAAATATTAATCGGTATCCAGGAGGCAGCTGGGGGAATATTAATCGGTATCCAGGAG 
GCAGCTGGGGGAATATTCATCTATACCCAGGTATCAATAACCCATTTCCTCCTGGAGTTCTCCGC 
CCTCCTGGCTCTTCTTGGAACATCCCAGCTGGCTTCCCTAATCCTCCAAGCCCTAGGTTGCAGTG 
GGGCTAGAGCACGATAGAGGGAAACCCAACATTGGGAGTTAGAGTCCTGCTCCCGCCCCTTGCTG 
TGTGGGCTCAATCCAGGCCCTGTTAACATGTTTCCAGCACTATCCCCACTTTTCAGTGCCTCCCC 
TGCTCATCTCCAATAAAATAAAAGCACTTATGAAAAAAA 
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FIGURE 64 



MQGRVAGS CAPLGLLLVCLHL PGL FARS I G WEEKVS QNFGTNLPQLGQPS S TGPSNSEHPQPAL 
DPRSNDLARVPLKLSVPPSDGFPPAGGSAVQRWPPSWGLPAMDSWPPEDPWQMMAAAAEDRLGEA 
LPEELSYLSSAAALAPGSGPLPGESSPDATGLSPEASLLHQDSESRRLPRSNSLGAGGKILSQRP 
PWSLIHRVLPDHPWGTLNPSVSWGGGGPGTGWGTRPMPHPEGIWGINNQPPGTSWGNINRYPGGS 
WGNINRYPGGSWGNINRYPGGSWGNIHLYPGINNPFPPGVLRPPGSSWNIPAGFPNPPSPRLQWG 

Important features of the protein: 
Signal peptide: 

amino acids 1-2 6 

Casein kinase II phosphorylation sites. 

amino acids 56-59, 155-158 

N-myristoylation sites. 

amino acids 48-53, 220-225, 221-226, 224-229, 247-252, 258-263, 
259-264, 269-274, 270-275, 280-285, 281-286, 305-310 
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FIGURE 65 

AAGGAGAGGCCACC GGGACT TCAG T G T C T CCTCCAT CCCAGGAGCGCAGT GGCCAC TATGGGGT C 
TGGGCTGCCCCTTGTCCTCCTCTTGACCCTCCTTGGCAGCTCACATGGAACAGGGCCGGGTATGA 
CTTTGCAACTGAAGCTGAAGGAGTCTTTTCTGACAAATTCCTCCTATGAGTCCAGCTTCCTGGAA 
TTGCTTGAAAAGCTCTGCCTCCTCCTCCATCTCCCTTCAGGGACCAGCGTCACCCTCCACCATGC 
AAGATCTCAACACCATGTTGTCTGCAACACATGACAGCCATTGAAGCCTGTGTCCTTCTTGGCCC 
GGGCTTTTGGGCCGGGGATGCAGGAGGCAGGCCCCGACCCTGTCTTTCAGCAGGCCCCCACCCTC 
CTGAGTGGCAATAAATAAAATTCGGTATGCTG 
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FIGURE 66 

MGSGLPLVLLLTLLGSSHGTGPGMTLQLKLKESFLTNSSYESSFLELLEKLCLLLHLPSGTSVTL 
HHARS QHHWCNT 



Important features: 
Signal peptide: 

amino acids 1-19 



JJ; N-glycosylation site. 

m' amino acids 37-41 



N-myristoylation sites. 



m 

O amino acids 15-21, 19-25, 60-66 
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FIGURE 67 



ACGGACCGAGGGTTCGAGGGAGGGACACGGACCAGGAACCTGAGCTAGGTCAAAGACGCCCGGGC 
CAGGTGCCCCGTCGCAGGTGCCCCTGGCCGGAGATGCGGTAGGAGGGGCGAGCGCGAGAAGCCCC 
TTCCTCGGCGCTGCCAACCCGCCACCCAGCCCATCGCGAACCCCGGGCTGGGGCTGCTTCTGGCG 
CTGGGCCTGCCGTTCCTGCTGGCCCGCTGGGGCCGAGCCTGGGGGCAAATACAGACCACTTCTGC 
AAATGAGAATAGCACTGTTTTGCCTTCATCCAGCAGCTCCAGCTCCGATGGCAACCTGCGTCCGG 
AAGCCATCACTGCTATCATCGTGGTCTTGTCCCTCTTGGCTGCCTTGCTCCTGGCTGTGGGGCTG 
GCACTGTTGGTGCGGAAGCTTCGGGAGAAGCGGCAGACGGAGGGCACCTACCGGCCCAGTAGCGA 

C3 GGAGCAGTTCTCCCATGCAGCCGAGGCCCGGGCCCCTCAGGACTCCAAGGAGACGGTGCAGGGCT 

W GCCTGCCCATCTAGGTCCCCTCTCCTGCATCTGTCTCCCTTCATTGCTGTGTGACCTTGGGGAAA 

GGCAGTGCCCTCTCTGGGCAGTCAGATCCACCCAGTGCTTAATAGCAGGGAAGAAGGTACTTCAA 

ffl AGACT CT GGCCC TGAGGTCAAGAGAGGATGGGGCTAT TCACTT TTATATAT T TATATAAAATTAG 

O TAGTGAGATGTAAAAAAAAAAAAAAAAAA 



w 
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FIGURE 68 

MANPGLGLLIALGLPFLLi^WGRAWGQIQTTSi^ENSTVLPSSTSSSSDGNLRPEAITAIIWFS 
LLAALLIAVGLALLVRKLREKRQTEGT^^ 

Important features: 
Signal peptide: 

amino acids 1-19 

Transmembrane domain: 

amino acids 56-80 

N-glycosylation site. 

amino acids 36-40 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 86-90 

Tyrosine kinase phosphorylation site. 

amino acids 86-94 

N-myristoylation sites. 

amino acids 7-13, 26-32 
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FIGURE 6Q 

GCCAGGAATAACTAGAGAGGAACAATGGGGTTATTCAGAGGTTTTGTTTTCCTCTTAGTTCTGTGCCTGCTGCACCAG 
TCAAATACTT CCTTCATTAAGCT GAATAATAATGGCTTTGAAGATATTGT CATTGTTATAGATCCTAGTGTGCCAGAA 
GAT GAAAAAATAATTGAACAAAT AGAGGATAT GGT GACT AC AGCT T CTAC GTACCT GTTT GAAGCCAC AGAAAAAAGA 
TTTT1 TTT CAAAAAT GT ATCT ATAT TAATT CCTGAGAATT GGAAGGAAAAT CCTCAGTACAAAAGGCCAAAAGAT GAA 
AACCAT AAACAT GCT GATGT T ATAGTT GCAC CACCTACACT CCCAGGTAGAGATGAACCAT ACACCAAGCAGTT CACA 
GAATGT G GAGAGAAAGG C GAATACAT T CACT T CAC C CCT GACC TT CTACT T GGAAAAAAAC AAAATGAATAT GGACCA 
CCAGGCAAACTGTTTGTCCATGAGTGGGCTCACCTCCGGTGGGGAGTGTTTGATGAGTACAATGAAGATCAGCCTTTC 
TACCGTGCTAAGTCAAAAAAAATCGAAGCAACAAGGTGTTCCGCAGGTATCTCTGGTAGAAATAGAGTTTATAAGTGT 
CAAGGAGGCAGCT GT CT TAGT AGAGC AT GCAGAAT T GATT CT ACAACAAAACT GTAT GGAAAAGATT GTCAAT T CT TT 
CCTGATAAAGTACAAACAGAAAAAGCATCCATAAT GTTTATGCAAAGTATTGATTCTGTT GTTGAATTTTGTAACGAA 
AAAACCCATAATCAAGAAGCT CCAAGCCTACAAAACAT AAAGTGCAATTTTAGAAGTACAT GGGAGGT GATTAGCAAT 
¥** T CT GAGGATTT TAAAAACACCAT ACCC AT G GT GACACCACCT CCT CCAC CTGT CTT CT CATTG CT GAAGAT CAGT C?A 

PI AGAAT T GT GT GCTTAGTT CT T GAT AAGT CT GGAAG CAT GGG GG GT AAG GACCGCCT AAAT CGAATGAAT CAAGC AGCA 

sSi AAACATT T C CT GCT GC AGACT GTT GAAAAT GGAT C CTGGGT GGGGATGGT T CACTT T GAT AGTACT GC CACT AT T GTA 

M AATAAGCTAATCCAAAT AAAAAGCAGT GATGAAAGAAACACACT CATGG C AGGAT T AC CT ACAT AT CCTCT GG GAGGA 

ill ACTT C CAT CT GCT CTGGAATT AAATATGCAT TT CAGGT GATT GGAGAG CT ACAT TC CCAACTCGAT GGAT CC GAAGTA 

r..; CTGCT GCT GACTGAT GGGGAGGAT AACACT GCAAGT TCTT GT ATT GAT GAAGT GAAACAAAGTGGGGC CATT GT T CAT 

TT TAT T GCT TT GGGAAGAGCT GCT GAT GAAGCAGT AAT AGAGAT GAGCAAGAT AACAGGAGGAAGT CAT TT TTAT GTT 
Cf! TCAGATGAAGCTCAGAACAATGGCCTCATTGATGCTTTTGGGGCTCTTACATCAGGAAATACTGATCTCTCCCAGAAG 
**% T CC CTT CAG CTCGAAAGTAAGGGATTAACACTGAATAGTAAT G CCT GGAT GAACGACACT GTCATAAT T GAT AGTACA 

y GT GGGAAAGGACACGTTCTTT CT CATCACAT GGAACAGT CTGCCT C CCAGTATTT CT CT CT G GGAT CCCAGT GGAACA 

llfi AT AAT GGAAAATTT CACAGT GGAT GCAACTT CCAAAAT GGCC TAT CTCAGTATTCCAG GAACT GCAAAGGT GGGCACT 

TGGGCATACAATCTTCAAGCCAAAGCGAAC CCAGAAAC ATTAACT ATTACAGTAACT T CT C GAGCAGCAAATT C TTCT 
1.. GT GC CT CCAATCACAGTGAAT GCT AAAAT GAATAAGGAC GTAAACAGTT T CCC CAGC CCAATGATT GT T TAC G CAGAA 

O ATTCTACAAGGATATGTACCTGTTCTTGGAGCCAATGTGACTGCTTTCATTGAATCACAGAATGGACATACAGAAGTT 
JgB T T GGAACTT TTGGATAAT GGT GCAGGC G CTGATT CT T T CAAGAAT GAT GGAGT CTACT CCAGGTATTT TACAGCATAT 

* R AC AGAAAAT GGCAGAT ATAGCT T AAAAGT TCGGGCT CAT GGAG GAG CAAACACTGC C AG GCTAAAATT ACGGCCTCC A 

13 CT GAAT AGAGCCGCGT ACAT ACCAGGCT GGGT AGT GAAC GGGGAAAT T GAAGCAAAC CC GCCAAGACCT GAAAT T GAT 

s.„5 GAGGAT ACT CAGACCACCTT GGAGGAT T T CAGCC GAACAGC AT C CGGAGGT GCATT T GT GGTAT CACAAGT C CCAAG C 

^ CTTCCCTTGCCTGACCAATACCCACCAAGTCAAATCACAGACCTTGATGCCACAGTTCATGAGGATAAGATTATTCTT 
C3 ACAT GGACAGCAC CAG GAGAT AAT TTT GATGT T GGAAAAGTT CAAC GTT ATAT CAT AAGAAT AAGT GCAAGT AT TCTT 

3*S GAT C T AAGAGACAGTT TTGAT GAT GCT CTTC AAGTAAAT AC T ACTGAT C T GT CACCAAAG GAGGCC AACTC C AAGGAA 

AGCTTT GCATTTAAAC CAGAAAAT ATCT CAGAAGAAAAT GCAACCCACATATTTATTGCCATTAAAAGTATAGATAAA 
AGCAATT TGACAT CAAAAGTAT CCAACAT TGCACAAGT AACT T T GT TTAT CC CTCAAGCAAATCCT GATGACAT T GAT 
C CTACACCTACTC CTACT CCTACT CCT ACTC CT GATAAAAGT CAT AAT T CTG GAGT T AATAT TT CTACGCT GGTATTG 
TCT GT GATT GGGT CTGT TGT AAT T GTT AACTT TATT TT AAGT AC C ACCATT TGAAC CT TAACGAAGAAAAAAAT CTT C 
AAGTAGAC C TAGAAGAGAGTT TT AAAAAACAAAACAAT GTAAGT AAAGGATATTT C T GAAT CTT AAAAT TCATC C CAT 
GT GT GATCAT AAACTC ATAAAAAT AATT T TAAGAT GT CGGAAAAG GAT ACT TT GAT T AAAT AAAAAC ACT CATGGATA 
TGTAAAAACT GTCAAGAT TAAAAT TTAATAGTTT C ATTT ATT T GTT ATT T TATTT GT AAGAAAT AGT GAT GAAC AAAG 
ATCCTTTTT CATACT GATACCT GGTT GT ATATTATT T GAT GCAACAGTTT T CT GAAAT GATATTT CAAATT GCAT CAA 
GAAATTAAAATCATCTATCT GAGT AGT CAAAATACAAGT AAAGGAGAGCAAATAAACAACATTT GGAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA^AAAAAAAAAAAAAAA 
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FIGURE 70 

MGLFRGFVFLLVLCLLHQSNTSFIKLNNNGFEDIVIVIDPSVPEDEKIIEQIEDMVTTASTYLFE 
ATEKRFFFKNVSILIPENWKENPQYKRPKHENHKHADVIVAPPTLPGRDEPYTKQFTECGEKGEY 
IHFTPDLLLGKKQNEYGPPGKLFVHEWAHLRWGVFDEYNEDQPFYRAKSKKIEATRCSAGISGRN 
RVYKCQGGSCLSRACRIDSTTKLYGKDCQFFPDKVQTEKASIMFMQSIDSVVEFCNEKTHNQEAP 
SLQNIKCNFRSTWEVISNSEDFKNTIPMVTPPPPPVFSLLKISQRIVCLVLDKSGSMGGKDRLNR 
MNQAAKHFLLQTVENGSWGMVHFDSTATIVNKLIQIKSSDERNTLMAGLPTYPLGGTSICSGIK 
YAFQVI GELHSQLDGSEVLLLTDGEDNTAS SC I DEVKQSGAI VH FI ALGRAADEAVIEMSKI TGG 
SHFYVSDEAQNNGLIDAFGALTSGNTDLSQKSLQLESKGLTLNSNAWMNDTVIIDSTVGKDTFFL 
ITWNSLPPSISLWDPSGTIMENFTVDATSKMAYLSIPGTAKVGTWAYNLQAKANPETLTITVTSR 
AANS SVPP I TVNAKMNKDVNS FPSPMI VYAE I LQGY VPVLGANVTAF I E S QNGHTEVLELLDNGA 
GADS FKNDGVYSRYFTAYTENGRYS LKVRAHGGANTARLKLRP PLNRAAY I PGWWNGE I EANPP 
RPEIDEDTQTTLEDFSRTASGGAFWSQVPSLPLPDQYPPSQITDLDATVHEDKIILTWTAPGDN 
FDVGKVQRYI IRISAS ILDLRDS FDDALQVNTTDLS PKEANSKES FAFKPENI SEENATHI FIAI 
KSIDKSNLTSKVSNIAQVTLFIPQANPDDIDPTPTPTPTPTPDKSHNSGWISTLVLSVIGSWI 

VNFILSTTI 

Signal peptide: 

amino acids 1-21 

Putative transmembrane domains: 

amino acids 284-300, 617-633 

Leucine zipper pattern. 

amino acids 469-491, 476-498 

N-glycosylation site. 

amino acids 20-24, 75-79, 340-344, 504-508, 542-546, 588-592, 
628-632, 811-815, 832-836, 837-841, 852-856, 896-900 
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FIGURE 71 



C T CCT T AGGT G GAAAC C CTGGGAGTAGAGTACT GACAGCAAAGAC C GGGAAAGACCATAC GT C CCC GGGCAGGG GTGA 
CAACAGGTGTCATCTTTTTGATCTCGTGTGTGGCTGCCTTCCTATTTCAAGGAAAGACGCCAAGGTAATTTTGACCCA 
GAGGAGGAATGATGTAGCCACCTCCTAACCTTCCCTTCTTGAACCCCCAGTTATGCCAGGATTTACTAGAGAGTGTCA 
ACTCAACCAGCAAGCGGCTCCTTCGGCTTAACTTGTGGTTGGAGGAGAGAACCTTTGTGGGGCTGCGTTCTCTTAGCA 
GT GCT CAGAAGTGACTT GCCT GAGGGTGGACCAGAAGAAAGGAAAGGTCCCCTCTTGCTGTTGGCTGCACATCAGGAA 
GGCTGTGATGGGAATGAAGGTGAAAACTTGGAGATTTCACTTCAGTCATTGCTTCTGCCTGCAAGATCATCCTTTAAA 
AGT AGAGAAGCTGCTCT GTGT GGT GGTT AAC T CCAAGAGGCAGAACTCGT T CT AGAAGGAAATGGAT G CAAGCAGCTC 
CGGGGGCCCCAAACGCATGCTTCCTGTGGTCTAGCCCAGGGAAGCCCTTCCGTGGGGGCCCCGGCTTTGAGGGATGCC 
ACCGGTTCTGGACGCATGGCTGATTCCTGAATGATGATGGTTCGCCGGGGGCTGCTTGCGTGGATTTCCCGGGTGGTG 
GT T TT GCT GGT GCTC CT CTGCTGT GCT AT CT CT GT CCT GTACAT GT TGGC CTGCAC CCCAAAAGGTGACGAGGAGCAG 
CTGGCACTGCCCAGGGCCAACAGCCCCACGGGGAAGGAGGGGTACCAGGCCGTCCTTCAGGAGTGGGAGGAGCAGCAC 
C GCAACTACGT GAGCAGCCT GAAGCGGCAGATCGCACAGCTCAAGGAGGAG CT GCAGGAGAGGAGTGAGCAGCT CAGG 
in AATGGGCAGTACCAAGCCAGCGATGCTGCTGGCCTGGGTCTGGACAGGAGCCCCCCAGAGAAAACCCAGGCCGACCTC 

CTGGCCTTCCTGCACTCGCAGGTGGACAAGGCAGAGGTGAATGCTGGCGTCAAGCTGGCCACAGAGTATGCAGCAGTG 
Q CCTTT C GAT AGCTT TACT CTACAGAAGGTGTACCAGCTGGAGACT GGCCTTACCCGCCACCCCGAGGAGAAGCCT GTG 

m AGGAAGGACAAGCGGGATGAGTT GGTGGAAGCCATTGAATCAGCCTTGGAGACCCT GAACAATCCTGCAGAGAACAGC 

&l C CCAAT CAC CGTCCTT ACAC GGCCT CTGATT T CAT AGAAG GGATCT AC C GAACAGAAAGGGACAAAGGGACAT T GTAT 

Iff GAGCT CAC CTTCAAAGGGGACCACAAACAC GAAT T CAAACGGCT CATCT T ATT TCGACCAT T CAGCC CCAT CAT GAAA 

?|| GT GAAAAAT GAAAAG C T CAACAT GGC C AAC ACGCT TAT CAAT GTT ATC GT GCCT CTAGCAAAAAGGGT GGACAAGT TC 

Z** C GGC AGTTCAT GCAGAAT TT CAGGGAGAT GT GC ATT GAGCAGGAT GG GAGAGT CCAT CT C ACT GT TGT T TACT T T GGG 

y AAAGAAGAAATAAAT GAAGT CAAAGGAATACTTGAAAACACTTCCAAAGCT GCCAACT T CAG GAACTT TAC CTT CATC 

ff§ CAGCT GAAT GGAGAAT TT T CT CGGGGAAAGGGACTTGATGTTGGAGCCCGCTT CTGGAAGG GAAGCAACGT CCTT CTC 

1 ** TTTTTCTGT GATGTGGACAT CTACTTCACATCTGAATTCCTCAAT ACGT GTAGGCT GAATACACAGCCAGGGAAGAAG 

* GTATTT TAT CCAGT T CTT TT CAGT CAGTACAAT C CTG GCATAATATACGGCCAC CAT GATGCAGTCC CTCCCTTGGAA 

p| cagcagctggtcataaagaaggaaactggattttggagagactttggatttgggatgacgtgtcagtatcggtcagac 
:g ttcatcaatataggtgggtttgatctggacatcaaaggctggggcggagaggatgtgcacctttatcgcaagtatctc 
III cacagcaacctcatagtggtacggacgcctgtgcgaggactcttccacctctggcatgagaagcgctgcatggacgag 
f*% ctgacccccgagcagtacaagatgtgcatgcagtccaaggccatgaacgaggcatcccacggccagctgggcatgctg 
;*? gtgttcaggcacgagatagaggctcaccttcgcaaacagaaacagaagacaagtagcaaaaaaacatgaactcccaga 
III gaaggattgtgggagacacttttt cttt ccttttgcaattactgaaagt ggct gcaacagagaaaagactt ccataaa 

ft? ggacgacaaaagaatt ggact gat gggt cagagatgagaaagcctccgatttctctctgtt gggctttttacaacaga 

:'f aatcaaaatctccgctttgcctgcaaaagtaacccagttgcaccctgtgaagtgtctgacaaaggcagaatgcttgtg 
fll agattataagcctaatggtgtggaggttttgatggtgtttacaatacactgagacctgttgttttgtgtgctcattga 

aatattc^tgatttaagagcagttttgtaaaaaattcattagcatgaaaggcaagcatatttctcctcatatgaatga 
gcctatcagcagggctctagtttctaggaatgctaaaatatcagaaggcaggagaggagataggcttattatgatact 
agtgagtactvttaagtaaaataaaatggaccagaat^gaaaagaaaccataaatat cgtgt catattttccccaagat 
taacc^w^taatctgcttatctttttggttgtccttttaactgtctccgtttttttcttttatttaaaaatgcact 
tttttt cccttgt gagttatagt ctgcttatttaattaccactttgcaagccttacaagagagcacaagtt ggcctac 

ATTTTT ATATTTTTTAAGAAGATACTTT GAGATGCATT ATGAGAACTTTCAGTTCAAAGCATCAAATT GAT GCCATAT 
CCAAGGACATGCCAAAT GCTGATT CTGT CAGGCACTGAATGT CAGGCATTGAGACATAGGGAAGGAAT GGTTTGTACT 
AATACAGACGTACAGATACTTTCTCTGAAGAGTATTTTCGAAGAGGAGCAACTGAACACTGGAGGAAAAGAAAATGAC 
ACTTT CT GCTT TACAGAAAAGGAAACT CATT CAGACT GGTGATAT CGTGAT GT ACCT AAAAGTCAGAAAC CACAT T TT 
CT C CT CAGAAGTAGGGACC GCT T T CT T ACCT GTT TAAATAAAC CAAAGT ATAC CGT GTGAAC C AAACAAT CT CT T T T C 
AAAACAGGGTGCT CCT CCTGGCTT CTGGCTT CCATAAGAAGAAATGGAGAAAAATAT ATAT ATATATATAT ATATTGT 
GAAAGATCAAT C CATCT G CCAGAATCT AGTGGGAT GGAAGTT T TT GCTACAT GT TAT CCACCCCAGGCCAGGTGGAAG 
TAACT GAAT TATT T TT TAAATTAAGCAGTTCT ACTCAAT CAC CAAGATGCT T CT GAAAATT GCATT TT ATT ACCATTT 
CAAACT ATTTT TTAAAAATAAAT ACAGT TAAC AT AGAGT GGT TT CTT CATT CAT GT GAAAAT TAT TAGCCAGCACCAG 

at g cat gagctaat tat ct ct tt gagt cctt gctt ct gttt gct cacagtaaactcatt gtt taaaagctt caagaac 
attcaagctgttggtgtgttaaaaaatgcattgtattgatttgtactggtagtttatgaaatttaattaaaacacagg 
c cat gaat ggaaggtggtatt gcacagctaat aaaatat gatttgt ggatatgaa 
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FIGURE 72 



MMMVRRGLLAW I S RVWLLVLLCCAI S VL YMLACT PKGDEEQLALPRANS PTGKEGYQAVLQEWE 
EQHRNYVS SLKRQ IAQLKEE LQERSEQLRNGQYQAS DAAGLGLDRS PPEKT QADLLAFLHSQVDK 
AEVNAGVKLATEYAAVPFDSFTLQKVYQLETGLTRHPEEKPVRKDKRDELVEAIESALETLNNPA 
ENS PNHRPYTAS D F I EG I YRTE RDKG T L YE L T FKGDHKHE FKRL I L FRP FS P IMKVKNEKLNMAN 
TLINVIVPL7^RVDKFRQFMQNFREMCIEQDGRVHLTWYFGKEEINEVKGILENTSKAANFRNF 
TFIQLNGEFSRGKGLDVGARFWKGSNVLLFFCDVDIYFTSEFLNTCRLNTQPGKKVFYPVLFSQY 
NPGI IYGHHDAVPPLEQQLVIKKETGFWRDFGFGMTCQYRSDFINIGGFDLDIKGWGGEDVHLYR 
KYLHSNLIWRTPVRGLFHLWHEKRCMDELTPEQYKMCMQSKAMNEASHGQLGMLVFRHEIEAHL 
RKQKQKTSSKKT 

Important features: 
Signal peptide: 

amino acids 1-27 

N-glycosylation sites. 

amino acids 315-319, 324-323 

N-myristoylation sites. 

amino acids 96-102, 136-142, 212-218, 311-317, 339-345, 393-399 

Amidation site. 

amino acids 377-381 
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FIGURE 73 



G AGAC TGCAGAGGGAGATAAAGAGAGAGGGCAAAGAGGC AGCAAGAGAT T T GTC C T GGGGAT CCA 
GAAAC CCAT GAT AC CC T AC T GAAC AC CGAATCCCC T GGAAGCC CAC AGAGACAGAGAC AGCAAGA 
GAAGCAGAGATAAATACACTCACGCCAGGAGCTCGCTCGCTCTCTCTCTCTCTCTCTCACTCCTC 
CCTCCCTCTCTCTCTGCCTGTCCTAGTCCTCTAGTCCTCAAATTCCCAGTCCCCTGCACCCCTTC 
CTGGGACACTATGTTGTTCTCCGCCCTCCTGCTGGAGGTGATTTGGATCCTGGCTGCAGATGGGG 
GTCAACACTGGACGTATGAGGGCCCACATGGTCAGGACCATTGGCCAGCCTCTTACCCTGAGTGT 
GGAAACAATGCCCAGTCGCCCATCGATATTCAGACAGACAGTGTGACATTTGACCCTGATTTGCC 
TGCTCTGCAGCCCCACGGATATGACCAGCCTGGCACCGAGCCTTTGGACCTGCACAACAATGGCC 
ACACAGTGCAACTCTCTCTGCCCTCTACCCTGTATCTGGGTGGACTTCCCCGAAAATATGTAGCT 
GCCCAGC T CCACCT GCAC TGGGG T CAGAAAGGATCCCCAGGGGGGTCAGAACACCAGAT CAACAG 



p TGAAGCCACATTTGCAGAGCTCCACATTGTACATTATGACTCTGATTCCTATGACAGCTTGAGTG 
13 AGGCTGCTGAGAGGCCTCAGGGCCTGGCTGTCCTGGGCATCCTAATTGAGGTGGGTGAGACTAAG 
AAT AT AGC T TATGAACACAT T CTGAGT CAC T T GCAT G AAG T CAG GC AT AAAGAT CAGAAG ACC TC 
AGTGCCTCCCTTCAACCTAAGAGAGCTGCTCCCCAAACAGCTGGGGCAGTACTTCCGCTACAATG 
tm GCTCGCTCACAACTCCCCCTTGCTACCAGAGTGTGCTCTGGACAGTTTTTTATAGAAGGTCCCAG 
gf ATT TCAAT GGAACAGC T GGAAAAGCT TCAGGGGACAT TGT T CT CCACAGAAGAGGAGC CC TC TAA 

III GCTTCTGGTACAGAACTACCGAGCCCTTCAGCCTCTCAATCAGCGCATGGTCTTTGCTTCTTTCA 

TCCAAGCAGGATCCTCGTATACCACAGGTGAAATGCTGAGTCTAGGTGTAGGAATCTTGGTTGGC 
TGTCTCTGCCTTCTCCTGGCTGTTTATTTCATTGCTAGAAAGATTCGGAAGAAGAGGCTGGAAAA 
CCGAAAGAGT GT GGTC T TCACCT CAGCACAAGC CACGAC TGAGGC ATAAA T TCCT TC T CAGAT AC 
CATGGATGTGGATGACTTCCCTTCATGCCTATCAGGAAGCCTCTAAAATGGGGTGTAGGATCTGG 
CCAGAAACACTGTAGGAGTAGTAAGCAGATGTCCTCCTTCCCCTGGACATCTCTTAGAGAGGAAT 
GGACC CAG GCTGTCAT TC CAGGAAGAACTGCAGAGCC T T CA.GC C TCT CCAAACATGTAGGAGGAA 
ATGAGGAAATCGCTGTGTTGTTAATGCAGAGANCAAACTCTGTTTAGTTGCAGGGGAAGTTTGGG 
ATATACCCCAAAGTCCTCTACCCCCTCACTTTTATGGCCCTTTCCCTAGATATACTGCGGGATCT 
C TCCT TAGGATAAAGAGT TGCT GTTGAAG TTGTATAT T TT TGATCAATATATTTGGAAAT TAAAG 
TTTCTGACTTT 
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FIGURE 74. 



MLFSALLLEVIWILAADGGQHWTYEGPHGQDHWPASYPECGNNAQSPIDIQTDSVTFDPDLPALQ 
PHGYDQPGTEPLDLHNNGHTVQLSLPSTLYLGGLPRKYVAAQLHLHWGQKGSPGGSEHQINSEAT 
FAELHIVHYDSDSYDSLSEAAERPQGLAVLGILIEVGETKNIAYEHILSHLHEVRHKDQKTSVPP 
FNLRELLPKQLGQYFRYNG5LTTPPCYQSVLWTVFYRRSQISMEQLEKLQGTLFSTEEEPSKLLV 
QNYRALQPLNQRMVFASFI QAGS SYTTGEMLSLGVGI LVGCLCLLLAVYFIARKIRKKRLENRKS 
WFT SAQATTEA 

Important features of the protein: 
Signal peptide: 

amino acids 1-15 



T.r s 

U Transmembrane domain: 

amino acids 291-310 

N-glycosylation site. 

amino acids 213-216 



:;J Eukaryotic-type carbonic anhydrases proteins 

n * amino acids 197-245, 104-140, 22-69 
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FIGURE 75 



1 ;: 



TGCCGCTGCCGCCGCTGCTGCTGTTGCTCCTGGCGGCGCCTTGGGGACGGGCAGTTCCCTGTGTC 
TCTGGTGGTTTGCCTAAACCTGCAAACATCACCTTCTTATCCATCAACATGAAGAATGTCCTACA 
ATGGACT CCACCAGAGGGTCT T CAAGGAG T T AAAG T T ACT T ACACTGTGCAGTAT T TCAT CACAA 
ATTGGCCCACCAGAGGTGGCACTGACTACAGATGAGAAGTCCATTTCTGTTGTCCTGACAGCTCC 
AGAGAAGTGGAAGAGAAATCCAGAAGACCTTCCTGTTTCCATGCAACAAATATA.CTCCAATCTGA 
AGTATAACGTGTCTGTGTTGAATACTAAATCAAACAGAACGTGGTCCCAGTGTGTGACCAACCAC 
ACGCTGGTGCTCACCTGGCTGGAGCCGAACACTCTTTACTGCGTACACGTGGAGTCCTTCGTCCC 
AGGGCCCCCTCGCCGTGCTCAGCCTTCTGAGAAGCAGTGTGCCAGGACTTTGAAAGATCAATCAT 
W C AGAGT T CAAGGCTAAAAT CAT C T TC TGG TAT GT T T T GC CCAT ATC TAT TACCGT GT TTCTTTTT 

f.J TCT G T GATGGGC TATTCCATCT ACCGATATATCCACGTT GGCAAAGAGAAACACCCAGCAAAT TT 

III 

fff GATTTTGATTTATGGAAATGAATTTGACAAAAGATTCTTTGTGCCTGCTGAAAAAATCGTGATTA 
fj| ACT T TAT CACCC TCAATATC T C GGAT GAT TCTAAAAT TT C T CATCAGGAT AT GAG T T T AC T GGGA 

ill AAAAGCAGTGATGTATCCAGCCTTAATGATCCTCAGCCCAGCGGGAACCTGAGGCCCCCTCAGGA 
? GGAAGAGGAGGTGAAACATTTAGGGTATGCTTCGCATTTGATGGAAATTTTTTGTGACTCTGAAG 
AAAACAC GGAAGGTAC T TC TC T CACC CAGCAAGAGT C CCT CAGCAGAACAATACCC CC GGAT AAA 
i*\ ACAGT CATT GAATATGAATATGATGT CAGAACCACTGACATTTGTGCGGGGCCTGAAGAGCAGGA 

Id GC T CAGT T T GCAGGAGGAGGT GT CCACACAAGGAACAT T ATTGGAGTCGCAGGCAGCGT T G GCAG 

0 TCTTGGGCCCGCAAACGTTACAGTACTCATACACCCCTCAGCTCCAAGACTTAGACCCCCTGGCG 

CAGGAGCACACAGACTCGGAGGAGGGGCCGGAGGAAGAGCCATCGACGACCCTGGTCGACTGGGA 
TCCCCAAACTGGCAGGCTGTGTATTCCTTCGCTGTCCAGCTTCGACCAGGATTCAGAGGGCTGCG 
AGCCTTCTGAGGGGGATGGGCTCGGAGAGGAGGGTCTTCTATCTAGACTCTATGAGGAGCCGGCT 
CCAGACAGGCCACCAGGAGAAAATGAAACCTAT C TCATGCAAT TCAT GGAGGAAT GGGGGT TATA 
TGTGCAGATGGAAAACTGATGCCAACACTTCCTTTTGCCTTTTGTTTCCTGTGCAAACAAGTGAG 
TCACCCCTTTGATCCCAGCCATAAAGTACCTGGGATGAAAGAAGTTTTTTCCAGTTTGTCAGTGT 
CTGTGAGAATTACTTATTTCTTTTCTCTATTCTCATAGCACGTGTGTGATTGGTTCATGCATGTA 
GGTCTCTTAACAATGATGGTGGGCCTCTGGAGTCCAGGGGCTGGCCGGTTGTTCTATGCAGAGAA 
AGCAGTCAATAAATGTTTGCCAGACTGGGTGCAGAATTTATTCAGGTGGGTGT 
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FIGURE 76 



MSYNGLHQRVFKELKLLTLCSISSQIGPPEVALTTDEKSISWLTAPEKWKRNPEDLPVSMQQIY 
SNLKYNVSVLNTKSNRTWSQCVTNHTLVLTWLEPNTLYCVHVESEYPGPPRRAQPSEKQCARTLK 
DQS SE FKAKI I FWYVLP I S I T VFLFSVMGYS I YRY IHVGKEKHPANL I L I YGNE FDKRFFVPAEK 
IVINFITLNISDDSKISHQDMSLLGKSSDVSSLNDPQPSGNLRPPQEEEEVKHLGYASHLMEIFC 
DSEENTEGTSLTQQESLSRTIPPDKTVIEYEYDVRTTDICAGPEEQELSLQEEVSTQGTLLESQA 
AIAVLGPQTLQYSYTPQLQDLDPLAQEHTDSEEGPEEEPSTTLVDWDPQTGRLCIPSLSSFDQDS 
EGCEPSEGDGLGEEGLLSRLYEEPAPDRPPGENETYLMQFMEEWGLYVQMEN 

Important features: 
Signal peptide: 

amino acids 1-28 

Transmembrane domain: 

amino acids 140-163 

N-glycosylation sites. 

amino acids 71-74, 80-83, 89-92, 204-207, 423-426 
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FIGURE 77 

GAGGAGCGGGCCGAGGACTCCAGCGTGCCCAGGTCTGGCATCCTGCACTTGCTGCCCTCTGACAC 
CTGGGAAGATGGCCGGCCCGTGGACCTTCACCCTTCTCTGTGGTTTGCTGGCAGCCACCTTGATC 
CAAGC CACC C T CAG T CCC AC TGCAGT TC TCATC C TCGGCCCAAAAG TCAT CAAAGAAAAGCT GAC 
ACAGGAGCTGAAGGACCACAACGCCACCAGCATCCTGCAGCAGCTGCCGCTGCTCAGTGCCATGC 
GGGAAAAGCCAGCCGGAGGCATCCCTGTGCTGGGCAGCCTGGTGAACACCGTCCTGAAGCACATC 
ATCTGGCTGAAGGTCATCACAGCTAACATCCTCCAGCTGCAGGTGAAGCCCTCGGCCAATGACCA 
GGAGCTGCTAGTCAAGATCCCCCTGGACATGGTGGCTGGATTCAACACGCCCCTGGTCAAGACCA 
TCGTGGAGTTCCACATGACGACTGAGGCCCAAGCCACCATCCGGATGGACACCAGTGCAAGTGGC 
CCCACCCGCCTGGTCCTCAGTGACTGTGCCACCAGCCATGGGAGCCTGCGCATCCAACTGCTGTA 
TAAGCTCTCCTTCCTGGTGAACGCCTTAGCTAAGCAGGTCATGAACCTCCTAGTGCCATCCCTGC 
C CAAT CT AGTGAAAAACC AGC T G T G T CCC GT GAT C GAGGCT TC C T T CAAT GG CAT GT AT GC AGAC 
CTCCTGCAGCTGGTGAAGGTGCCCATTTCCCTCAGCATTGACCGTCTGGAGTTTGACCTTCTGTA 
TCCTGCCATCAAGGGTGACACCATTCAGCTCTACCTGGGGGCCAAGTTGTTGGACTCACAGGGAA 
AGGTGACCAAGTGGTTCAATAACTCTGCAGCTTCCCTGACAATGCCCACCCTGGACAACATCCCG 
TTCAGCCTCATCGTGAGTCAGGACGTGGTGAAAGCTGCAGTGGCTGCTGTGCTCTCTCCAGAAGA 
ATTCATGGTCCTGTTGGACTCTGTGCTTCCTGAGAGTGCCCATCGGCTGAAGTCAAGCATCGGGC 
TGATCAATGAAAAGGCTGCAGATAAGCTGGGATCTACCCAGATCGTGAAGATCCTAACTCAGGAC 
ACTCCCGAGTTTTTTATAGACCAAGGCCATGCCAAGGTGGCCCAACTGATCGTGCTGGAAGTGTT 
TCCCTCCAGTGAAGCCCTCCGCCCTTTGTTCACCCTGGGCATCGAAGCCAGCTCGGAAGCTCAGT 
TTTACACCAAAGGTGACCAACTTATACTCAACTTGAATAACATCAGCTCTGATCGGATCCAGCTG 
ATGAACTCTGGGATTGGCTGGTTCCAACCTGATGTTCTGAAAAACATCATCACTGAGATCATCCA 
CTCCATCCTGCTGCCGAACCAGAATGGCAAATTAAGATCTGGGGTCCCAGTGTCATTGGTGAAGG 
CCTTGGGATTCGAGGCAGCTGAGTCCTCACTGACCAAGGATGCCCTTGTGCTTACTCCAGCCTCC 
TTGTGGAAACCCAGCTCTCCTGTCTCCCAGTGAAGACTTGGATGGCAGCCATCAGGGAAGGCTGG 
GTCCCAGCTGGGAGTATGGGTGTGAGCTCTATAGACCATCCCTCTCTGCAATCAATAAACACTTG 
CCTGTGAAAAA 



FIGURE 78 

MAGPWT FTLLCGLLAATL I QATLSPTAVLILGPKVI KEKLTQELKDHNATS I LQQLPLLSAMREK 
PAGG I PVLGS LVNT VLKHI I WLKVI TAN I LQLQVKPSANDQELLVKI PLDMVAGFNT PLVKT I VE 
FHMTTEAQATIRMDTSASGPTRLVLSDCATSHGSLRIQLLYKLSFLVNALAKQVMNLLVPSLPNL 
VKNQLCPVIEASFNGMYADLLQLVK\?PISLSIDRLEFDLLYPAIKGDTIQLYLGAKLLDSQGKVT 
KWFNMSAASLTMPTLDNIPFSLIVSQDVVKAAVAAVLSPEEFMVLLDSV 

EKAADKLGSTQIVKILTQDTPEFFIDQGHAKVAQLIVLEVFPSSEALRPLFTLGIEASSEAQFYT 
KGDQL I LNLNN I S S DRI QLMN S G I GW FQP DVLKN IITEIIHSILL PNQNGKLRS GVPVS LVKALG 
FEAAESSLTKDALVLTPASLWKPSSPVSQ 

Important features of the protein: 
Signal peptide: 

amino acids 1-21 

N-glycosylation sites. 

amino acids 48-51, 264-267, 401-404 

Glycosaminoglycan attachment site, 

amino acids 412-415 

LBP / BPI / CETP family proteins, 

amino acids 407-457 



FIGURE 7Q 



GAGAGAAGT CAGCC T GGCAGAGAGAC TCT GAAATGAGGGAT TAGAGGTGT T C AAGGAGCAAGAGC 

TTCAGCCTGAAGACAAGGGAGCAGTCCCTGAAGACGCTTCTACTGAGAGGTCTGCCATGGCCTCT 

CTTGGCCTCCAACTTGTGGGCTACATCCTAGGCCTTCTGGGGCTTTTGGGCACACTGGTTGCCAT 

GCTGCTCCCCAGCTGGAAAACAAGTTCTTATGTCGGTGCCAGCATTGTGACAGCAGTTGGCTTCT 

CCAAGGGCCTCTGGATGGAATGTGCCACACACAGCACAGGCATCACCCAGTGTGACATCTATAGC 

ACCCTTCTGGGCCTGCCCGCTGACATCCAGGCTGCCCAGGCCATGATGGTGACATCCAGTGCAAT 

CTCCTCCCTGGCCTGCATTATCTCTGTGGTGGGCATGAGATGCACAGTCTTCTGCCAGGAATCCC 

GAGCCAAAGACAGAGTGGCGGTAGCAGGTGGAGTCTTTTTGATCCTTGGAGGCCTCCTGGGATTC 

ATTCCTGTTGCCTGGAATCTTCATGGGATCCTACGGGACTTCTACTCACCACTGGTGCCTGACAG 

CAT GAAAT T TGAGAT TGGAGAGGCT CT T TAC T T GGGCAT TAT TTCTTCCCTGTTCTCCCT GATAG 

CTGGAAT CATCCTCTGCTTTTCCTGCTCATCCCAGAGAAATCGCTCCAACTACTACGATGCCTAC 

CAAGCCCAACCTCTTGCCACAAGGAGCTCTCCAAGGCCTGGTCAACCTCCCAAAGTCAAGAGTGA 

GTTCAATTCCTACAGCCTGACAGGGTATGTGT^AGAACCAGGGGCCAGAGCTGGGGGGTGGCTG 

GGTCTGTGAAAAACAGTGGACAGCACCCCGAGGGCCACAGGTGAGGGACACTACCACTGGATCGT 

GTCAGAAGGTGCTGCTGAGGATAGACTGACTTTGGCCATTGGATTGAGCAAAGGCAGAAATGGGG 

GCTAGTGTAACAGCATGCAGGTTGAATTGCCAAGGATGCTCGCCATGCCAGCCTTTCTGTTTTCC 

TCACCTTGCTGCTCCCCTGCCCTAAGTCCCCAACCCTCAACTTGAAACCCCATTCCCTTAAGCCA 

GGACTCAGAGGATCCCTTTGCCCTCTGGTTTACCTGGGACTCCATCCCCAAACCCACTAATCACA 

T CCCAC T GACTGACCC TCTGTGAT CAAAGAC CCT C T CT CT GGCT GAGG T TGGCT C T TAGC TCAT T 
GCTGGGGATGGGAAGGAGAAGCAGTGGCTTTTGTGGGCATTGCTCTAACCTACTTCTCAAGCTTC 
CCTCCAAAGAAACTGATTGGCCCTGGAACCTCCATCCCACTCTTGTTATGACTCCACAGTGTCCA 
GACTAATT TG TGCATGAACT GAAATAAAACCATCCTACGGTATCCAGGGAACAGAAAGCAGGATG 
CAGGAT GGGAGGACAGGAAGGC AGCC T GGGACAT T T AAAAAAATA 



FIGURE 80 

MASLGLQLVGYILGLLGLLGTLVAMLLPSWKTSSYVGASIVTAVGFSKGLWMECATHSTGITQCD 
I YS TLLGLPAD I QAAQAMMVT SSAI S SLAC 1 1 S WGMRCTVFCQESRAKDRVAVAGGVFFI LGGL 
LGFIPVAWNLHGILRDFYSPLVPDSMKFEIGEALYLGIISSLFSLIAGIILCFSCSSQRNRSNYY 

DAYQAQPLATRSSPRPGQPPKVKSEFNSYSLTGYV 

Important features of the protein: 
Signal peptide: 

amino acids 1-24 

Transmembrane domains: 

amino acids 82-102, 117-140, 163-182 

N-glycosylation site. 

amino acids 190-193 

PMP-22 / EMP / MP20 family proteins. 

amino acids 46-59 
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FIGURE 81 



CCCACGCGTCCGCGCCTCTCCCTTCTGCTGGACCTTCCTTCGTCTCTCCATCTCTCCCTCCTTTC 
CCCGCGTTCTCTTTCCACCTTTCTCTTCTTCCCACCTTAGACCTCCCTTCCTGCCCTCCTTTCCT 
GCCCACCGCTGCTTCCTGGCCCTTCTCCGACCCCGCTCTAGCAGCAGACCTCCTGGGGTCTGTGG 
GTTGATCTGTGGCCCCTGTGCCTCCGTGTCCTTTTCGTCTCCCTTCCTCCCGACTCCGCTCCCGG 
ACCAGCGGCCTGACCCTGGGGAAAGGATGGTTCCCGAGGTGAGGGTCCTCTCCTCCTTGCTGGGA 
CTCGCGCTGCTCTGGTTCCCCCTGGACTCCCACGCTCGAGCCCGCCCAGACATGTTCTGCCTTTT 
CCATGGGAAGAGATACTCCCCCGGCGAGAGCTGGCACCCCTACTTGGAGCCACAAGGCCTGATGT 
!;g ACTGCCTGCGCTGTACCTGCTCAGAGGGCGCCCATGTGAGTTGTTACCGCCTCCACTGTCCGCCT 
J| GTCCACTGCCCCCAGCCTGTGACGGAGCCACAGCAATGCTGTCCCAAGTGTGTGGAACCTCACAC 
!|| TCCCTCTGGACTCCGGGCCCCACCAAAGTCCTGCCAGCACAACGGGACCATGTACCAACACGGAG 
P AGATCTTCAGTGCCCATGAGCTGTTCCCCTCCCGCCTGCCCAACCAGTGTGTCCTCTGCAGCTGC 
W ACAGAGGGCCAGATCTACTGCGGCCTCACAACCTGCCCCGAACCAGGCTGCCCAGCACCCCTCCC 
^ ACTGCCAGACTCCTGCTGCCAAGCCTGCAAAGATGAGGCAAGTGAGCAATCGGATGAAGAGGACA 
p s GTGTGCAGTCGCTCCATGGGGTGAGACATCCTCAGGATCCATGTTCCAGTGATGCTGGGAGAAAG 
|ff AGAGGCCCGGGCACCCCAGCCCCCACTGGCCTCAGCGCCCCTCTGAGCTTCATCCCTCGCCACTT 
0 CAGACCCAAGGGAGCAGGCAGCACAACTGTCAAGAT CGT CCTGAAGGAGAAACATAAGAAAGCCT 

W GTGTGCATGGCGGGAAGACGTACTCCCACGGGGAGGTGTGGCACCCGGCCTTCCGTGCCTTCGGC 
H CCCTTGCCCTGCATCCTATGCACCTGTGAGGATGGCCGCCAGGACTGCCAGCGTGTGACCTGTCC 

CACCGAGTACCCCTGCCGTCACCCCGAGAAAGTGGCTGGGAAGTGCTGCAAGATTTGCCCAGAGG 
ACAAAGCAGACCCTGGCCACAGTGAGATCAGTTCTACCAGGTGTCCCAAGGCACCGGGCCGGGTC 
CTCGTCCACACATCGGTATCCCCAAGCCCAGACAACCTGCGTCGCTTTGCCCTGGAACACGAGGC 
C T C GGACT T GGT GGAGAT CTAC CT C T GGAAGC T GG TAAAAGAT GAGGAAAC T GAGGCT CAGAGAG 
GTGAAGTACCTGGCCCAAGGCCACACAGCCAGAATCTTCCACTTGACTCAGATCAAGAAAGTCAG 
GAAGCAAGACTTCCAGAAAGAGGCACAGCACTTCCGACTGCTCGCTGGCCCCCACGAAGGTCACT 
GGAACGTCTTCCTAGCCCAGACCCTGGAGCTGAAGGTCACGGCCAGTCCAGACAAAGTGACCAAG 
ACATAACAAAGACCTAACAGT TGCAGATATGAGCTGTATAATT GT TGT TATTAT ATAT TAATAAA 
TAAGAAGTT GCAT TACCC TCAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 82 



hi 
Iff 



MVPETOVLSSLLGLALLWFPLDSHAKARPDMFCLFHGKRYSPGESWHPYLEPQGmYCLRCTCSE 
GAHVSCYRLHCPP\rHCPQPVTEPQQCCPKCVEPHTPSGLRAPPKSCQHNGTMYQHGEIFSAHELF 
PSRLPNQCVLCSCTEGQIYCGLTTCPEPGCPAPLPLPDSCCQACKDEASEQSDEEDSVQSLHGVR 
HPQDPCSSDAGRKRGPGTPAPTGLSAPLSFIPRHFRPKGAGSTTVKIVLKEKHKKACVHGGKTYS 
HGEWHPAFRAFGPLPCILCTCEDGRQDCQRVTCPTEYPCRHPEKVAGKCCKICPEDKADPGHSE 
I S S T RC PKAPGR VL VH T S VS PS PDNLRR FALE HE AS DLVE I YL WKL VKDEE TEAQRGE VP G PRPH 
SQNLPLDSDQESQEARLPERGTALPTARWPPRRSLERLPSPDPGAEGHGQSRQSDQDITKT 

Signal peptide: 
amino acids 1-25 
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FIGURE 83 



PI 



GACAGCTGTGTCTCGATGGAGTAGACTCTCAGAACAGCGCAGTTTGCCCTCCGCTCACGCAGAGCCTCTCC 
GTGGCTTCCGCACCTTGAGCATTAGGCCAGTTCTCCTCTTCTCTCTAATCCATCCGTCACCTCTCCTGTCA 
TCCGTTTCCATGCCGTGAGGTCCATTCACAGAACACATCCATGGCTCTCATGCTCAGTTTGGTTCTGAGTC 
TCCTCAAGCTGGGATCAGGGCAGTGGCAGGTGTTTGGGCCAGACAAGCCTGTCCAGGCCTTGGTGGGGGAG 
GACGCAGCATTCTCCTGTTTCCTGTCTCCTAAGACCAATGCAGAGGCCATGGAAGTGCGGTTCTTCAGGGG 

CC AGT T CTCT AGCGTGGTCCACCT CT AC AGGGACGGGAAGG ACC AGCC AT T T AT GC AG ATGCCAC AGT ATC 
AAGGCAGGACAAAACTGGTGAAGGATTCTATTGCGGAGGGGCGCATCTCTCTGAGGCTGGAAAACATTACT 
GTGTTGGATGCTGGCCTCTATGGGTGCAGGATTAGTTCCCAGTCTTACTACCAGAAGGCCATCTGGGAGCT 
ACAGGTGTCAGCACTGGGCTCAGTTCCTCTCATTTCCATCACGGGATATGTTGATAGAGACATCCAGCTAC 
TCTGTCAGTCCTCGGGCTGGTTCCCCCGGCCCACAGCGAAGTGGAAAGGTCCACAAGGACAGGATTTGTCC 
Q ACAGACTCCAGGACAAACAGAGACATGCATGGCCTGTTTGATGTGGAGATCTCTCTGACCGTCCAAGAGAA 
||| CGCCGGGAGCATATCCTGTTCCATGCGGCATGCTCATCTGAGCCGAGAGGTGGAATCCAGGGTACAGATAG 
f3 GAGATACCTTTTTCGAGCCTATATCGTGGCACCTGGCTACCAAAGTACTGGGAATACTCTGCTGTGGCCTA 

TTTTTTGGCATTGTTGGACTGAAGATTTTCTTCTCCAAATTCCAGTGGAAAATCCAGGCGGAACTGGACTG 
GAGAAGAAAGCACGGACAGGCAGAATTGAGAGACGCCCGGAAACACGCAGTGGAGGTGACTCTGGATCCAG 
AGACGGCTCACCCGAAGCTCTGCGTTTCTGATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTG 
CCTCACTCTGAGAAGAGATTTACAAGGAAGAGTGTGGTGGCTTCTCAGAGTTTCCAAGCAGGGAAACATTA 

CT GGGAGGTGGACGGAGG ACAC AAT AAAAGGT GGCGCGTGGG AGT GTGCCGGGAT G ATGT GG ACAGGAGGA 
AGGAGTACGTGACTTTGTCTCCCGATCATGGGTACTGGGTCCTCAGACTGAATGGAGAACATTTGTATTTC 
ACATTAAATCCCCGTTTTATCAGCGTCTTCCCCAGGACCCCACCTACAAAAATAGGGGTCTTCCTGGACTA 
TGAGTGTGGGACCATCTCCTTCTTCAACATAAATGACCAGTCCCTTATTTATACCCTGACATGTCGGTTTG 
AAGGCTTATTGAGGCCCTACATTGAGTATCCGTCCTATAATGAGCAAAATGGAACTCCCATAGTCATCTGC 
CCAGTCACCCAGGAATCAGAGAAAGAGGCCTCTTGGCAAAGGGCCTCTGCAATCCCAGAGACAAGCAACAG 
TGAGTCCTCCTCACAGGCAACCACGCCCTTCCTCCCCAGGGGTGAAATGTAGGATGAATCACATCCCACAT 
TCTTCTTTAGGGATATTAAGGTCTCTCTCCCAGATCCAAAGTCCCGCAGCAGCCGGCCAAGGTGGCTTCCA 
GATGAAGGGGGACTGGCCTGTCCACATGGGAGTCAGGTGTCATGGCTGCCCTGAGCTGGGAGGGAAGAAGG 

CT GAC ATT ACAT T T AGTTT GCT CT C ACT CC AT CT GGC T AAGT GAT CTT G AAATACC ACCT CTCAGGTG AAG 
AACCGTCAGGAATTCCCATCTCACAGGCTGTGGTGTAGATTAAGTAGACAAGGAATGTGAATAATGCTTAG 
ATCTTATTGATGACAGAGTGTATCCTAATGGTTTGTTCATTATATTACACTTTCAGTAAAAAAA 



SI 



■ 
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FIGURE 84 



MALMLSLVLSLLKLGSGQWQVFGPDKPVQALVGEDAAFSCFLSPKTNAEAMSVRFFRGQFSSVVH 
LYRDGKDQPFMQMPQYQGRTKLVKDS IAEGRI SLRLENI TVLDAGLYGCRI S SQS YYQKAI WELQ 
VS ALGS VPL I S I TGYVDRD I QLLCQS SGW FPRP TAKWKGPQGQDLS TDS RTNRDMHGL FDVE I SL 
T VQENAG S I S CSMRHAHLSRE VE SRVQI GDTFFE P I SWHLATKVLG I LCCGL FFG I VGLK I FFS K 
FQWKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKS WAS QS FQAGKHYWEVDGGHNKRWRVGVCRDDVDRRKE YVTL S P DHGYWLRLNGEHL Y FT 
fa LNPRFISVFPRTPPTKIGVFLDYECGTISFFNINDQSLIYTLTCRFEGLLRPYIEYPSYNEQNGT 
O PIVICPVTQE5EKEASWQRASAIPETSNSESSSQATTPFLPRGEM 

U 

j„s Signal peptide: 

if* ' 

fis amino acids 1-17 

*^ Transmembrane domain: 

^ amino acids 239-255 

it 

3 
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FIGURE 85 



AACAGACGl'TCCCTCGCGGCCCTGGCACCTCimCCCCAGACATGC'rGCTGCTGCXGCTGCCCCT 

GCT CT GGGGGAGGGAGAGGGCGGAAGGACAGACAAGTAAACTGCTGACGATGCAGAGT TCCGTGA 

CGGTGCAGGAAGGCCTGTGTGTCCATGTGCCCTGCTCCTTCTCCTACCCCTCGCATGGCTGGATT 

TACCCTGGCCCAGTAGTTCATGGCTACTGGTTCCGGGAAGGGGCCAATACAGACCAGGATGCTCC 

AGTGGCCACAAACAACCCAGCTCGGGCAGTGTGGGAGGAGACTCGGGACCGATTCCACCTCCTTG 

GGGACCCACATACCAAGAATTGCACCCTGAGCATCAGAGATGCCAGAAGAAGTGATGCGGGGAGA 

TACT T C T T T CGT AT GGAGAAAGGAAGTAT AAAAT GG AAT T ATAAACATCACCGGC T C T CT G T GAA 

TGTGACAGCCTTGACCCACAGGCCCAACATCCTCATCCCAGGCACCCTGGAGTCCGGCTGCCCCC 

AGAATCTGACCTGCTCTGTGCCCTGGGCCTGTGAGCAGGGGACACCCCCTATGATCTCCTGGATA 

GGGACCTCCGTGTCCCCCCTGGACCCCTCCACCACCCGCTCCTCGGTGCTCACCCTCATCCCACA 

GCCCCAGGACCATGGCACCAGCCTCACCTGTCAGGTGACCTTCCCTGGGGCCAGCGTGACCACGA 

ACAAGACCGTCCATCTCAACGTGTCCTACCCGCCTCAGAACTTGACCATGACTGTCTTCCAAGGA 

GACGGCACAGTATCCACAGTCTTGGGAAATGGCTCATCTCTGTCACTCCCAGAGGGCCAGTCTCT 

GCGCCTGGTCTGTGCAGTTGATGCAGTTGACAGCAATCCCCCTGCCAGGCTGAGCCTGAGCTGGA 

GAGGCCTGACCCTGTGCCCCTCACAGCCCTCAAACCCGGGGGTGCTGGAGCTGCCTTGGGTGCAC 

CTGAGGGATGCAGCTGAATTCACCTGCAGAGCTCAGAACCCTCTCGGCTCTCAGCAGGTCTACCT 

GAACGTCTCCCTGCAGAGCAAAGCCACATCAGGAGTGACTCAGGGGGTGGTCGGGGGAGCTGGAG 

CCACAGCCCTGGTCTTCCTGTCCTTCTGCGTCATCTTCGTTGTAGTGAGGTCCTGCAGGAAGAAA 

TCGGCAAGGCCAGCAGCGGGCGTGGGAGATACGGGCATAGAGGATGCAAACGCTGTCAGGGGTTC 

AGCCTCTCAGGGGCCCCTGACTGAACCTTGGGCAGAAGACAGTCCCCCAGACCAGCCTCCCCCAG 

CTTCTGCCCGCTCCTCAGTGGGGGAAGGAGAGCTCCAGTATGCATCCCTCAGCTTCCAGATGGTG 

AAGCCT TGGGACTCGCGGGGACAGGAGGCCACTGACACCGAGTACTCGGAGATCAAGATC CACAG 

ATGAGAAACTGCAGAGACTCACCCTGATTGAGGGATCACAGCCCCTCCAGGCAAGGGAGAAGTCA 

G AGG C T GAT T C T T G T AGAAT T AAC AG C CC T C AAC G T GAT GAGC TAT GAT AAC AC TAT G AAT T AT G 
TGCAGAGTGAAAAGCACACAGGCTTTAGAGTCAAAGTATCTCAAACCTGAATCCACACTGTGCCC 
TCCCTTTTATTTTTTTAACTAAAAGACAGACAAATTCCTA 



FIGURE 86 



MLLLLLPLLWGRERAEGQTSKLLTMQSSVTVQEGLCVHVPCSFSYPSHGWIYPGPWHGYWFREG 
i^TDQDAPVATNNPARAWEETRDRFHLLGDPHTKNCTLSIRDAKRSDAGRYFFFMEKGSIKWNY 
KHHRLSVNVTALTHRPNILIPGTLESGCPQNLTCSVPWACEQGTPPMISWIGTSVSPLDPSTTRS 
SVLTLIPQPQDHGTSLTCQVTFPGASVTTNKTVHLNVSYPPQNLTMTVFQGDGTVSTVLGNGSSL 
SLPEGQSLRLVCAVDAVDSNPPARLSLSWRGLTLCPSQPSNPGVLELPWVHLRDAAEFTCRAQNP 
LGSQQVYLNVSLQSKATSGVTQGWGGAGATALVFLSFCVIFVWRSCRKKSARPAAGVGDTGIE 
DANAVRGSASQGPLTEPWAEDSPPDQPPPASARSSVGEGELQYASLSFQMVKPWDSRGQEATDTE 

YSEIKIHR 

Signal peptide: 

amino acids 1-15 

Transmembrane domain: 

amino acids 351-370 
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FIGURE 87 

AGAAAGCTGCACTCTGTTGAGCTCCAGGGCGCAGTGGAGGGAGGGAGTGAAGGAGCTCTCTGTAC 
CCAAGGAAAG T GCAGC TGAGACT CAGACAAGAT TACAATGAAC CAAC TCAGCT T CCTGCTGTTTC 
TCATAGCGACCACCAGAGGATGGAGTACAGATGAGGCTAATACTTACTTCAAGGAATGGACCTGT 
TCTTCGTCTCCATCTCTGCCCAGAAGCTGCAAGGAAATCAAAGACGAATGTCCTAGTGCATTTGA 
TGGCCTGTATTTTCTCCGCACTGAGAATGGTGTTATCTACCAGACCTTCTGTGACATGAGCTCTG 
GGGGTGGCGGCTGGACCCTGGTGGCCAGCGTGCATGAGAATGACATGCGTGGGAAGTGCACGGTG 
^ GGCGATCGCTGGTCCAGTCAGCAGGGCAGCAAAGCAGACTACCCAGAGGGGGACGGCAACTGGGC 
S CAAC TACAACACCT T T GGAT CT GC AGAGGCGGCCACGAGCGAT GACTACAAGAACCC T GGC T AC T 

jS ACGACATCCAGGCCAAGGACCT GGGCATC TGGCACGT GCCCAATAAGTCCCCCAT GCAGCACTGG 

Iff AGAAACAGCTCCCTGCTGAGGTACCGCACGGACACTGGCTTCCTCCAGACACTGGGACATAATCT 
P 1 GTTTGGCATCTACCAGAAATATCCAGTGAAATATGGAGAAGGAAAGTGTTGGACTGACAACGGCC 

CGGTGATCCCTGTGGTCTATGATTTTGGCGACGCCCAGAAAACAGCATCTTATTACTCACCCTAT 
^ GGCCAGCGGGAATTCACTGCGGGATTTGTTCAGTTCAGGGTATTTAATAACGAGAGAGCAGCCAA 
m CGCCTTGTGTGCTGGAATGAGGGTCACCGGATGTAACACTGAGCATCACTGCATTGGTGGAGGAG 
II GATACTTTCCAGAGGCCAGTCCCCAGCAGTGTGGAGATTTTTCTGGTTTTGATTGGAGTGGATAT 
S3 GGAACTCAT G T T GG T T AC AGCAGC AGCCG T GAGATAACT GAGGCAGC T GT GC T T C TAT T CT AT CG 

|| TTGAGAGTTTTGTGGGAGGGAACCCAGACCTCTCCTCCCAACCATGAGATCCCAAGGATGGAGAA 
5 CAACT T ACC CAG T AGC TAGAAT G T T AAT GGCAGAAGAGAAAAC AATAAATCAT AT T GAC T CAAGA 

AAAAAA 
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FIGURE 88 



MNQLSFLLFLIATTRGWSTDEANTYFKEWTCSSSPSLPRSCKEIKDECPSAFDGLYFLRTENGVI 

yqtfcdmtsggggwtlvasvhendmrgkctvgdrwssqqgskadypegdgnwanyntfgsm:aat 

SDDYKNPGYYDIQAKDLGIWHVPNKSPMQHWRNSSLLRYRTDTGFLQTLGHNLFGIYQKYPVKYG 
EGKCWTDNGPVIPWYDFGDAQKTASYYSPYGQREFTAGFVQFRVFNNERAANALCAGMRVTGCN 
TEHHCIGGGGYFPEASPQQCGDFSGFDWSGYGTHVGYSSSREITEAAVLLFYR 

Important features: 
Signal peptide: 



amino acids 1-16 



N-glycosylation site. 



amino acids 163-167 



Glycosaminoglycan attachment sites. 



amino acids 74-73, 289-293 



V N £ 

ri 



N-myristoylation sites. 

amino acids 76-82, 115-121, 124-130, 253-259, 292-298 
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FIGURE 8q 



CTAGATTTGTCGGCTTGCGGGGAGACTTCAGGAGTCGCTGTCTCTGAACTTCCAGCCTCAGAGAC 
CGCCGCCCTTGTCCCCGAGGGCCATGGGCCGGGTCTCAGGGCTTGTGCCCTCTCGCTTCCTGACG 
CTCCTGGCGCATCTGGTGGTCGTCATCACCTTATTCTGGTCCCGGGACAGCAACATACAGGCCTG 
CCTGCCTCTCACGTTCACCCCCGAGGAGTATGACAAGCAGGACATTCAGCTGGTGGCCGCGCTCT 
CTGTCACCCTGGGCCTCTTTGCAGTGGAGCTGGCCGGTTTCCTCTCAGGAGTCTCCATGTTCAAC 
AGCACCCAGAGCCTCATCTCCATTGGGGCTCACTGTAGTGCATCCGTGGCCCTGTCCTTCTTCAT 
ATTCGAGCGTTGGGAGTGCACTACGTATTGGTACATTTTTGTCTTCTGCAGTGCCCTTCCAGCTG 
TCACT GAAATGGCT T TAT TC GT CACC G T C T T TGGGCT GAAAAAGAAAC CCT TCTGAT T ACC T TCA 
TGACGGGAACCTAAGGACGAAGCCTACAGGGGCAAGGGCCGCTTCGTATTCCTGGAAGAAGGAAG 
GCATAGGCTTCGGTTTTCCCCTCGGAAACTGCTTCTGCTGGAGGATATGTGTTGGAATAATTACG 
TCTTGAGTCTGGGATTATCCGCATTGTATTTAGTGCTTTGTAATAAAATATGTTTTGTAGTAACA 
T TAAGAC T T ATATACAGT T T TAGGGGACAAT T AAAAAAAAAAAA 
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FIGURE QQ 

MGRVSGLVPSRFLTLLAHLWVITLFWSRDSNIQACLPLTFTPEEYDKQDIQLVAALSVTLGLFA 
VELAGFL S GVSMFNS TQSL I S I GAHCSAS VALS FFI FERWECTT YWYI FVFC SALPAVTEMALFV 
TVFGLKKKPF 



Transmembrane domain: 

amino acids 12-28 (type II), 51-66, 107-124 

0 

o 

B 

w 
m 

Ct- 
rl 

a 

m . 

Ci 

w 

C3 

II 
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FIGURE Qi 

C T G G G AC C C C GAAAAGAGAAG G G GAG AG C GAG G G G AC GAG AG C G GAGGAG GAAGATGC AAC T G AC 
TCGCTGCTGCTTCGTGTTCCTGGTGCAGGGTAGCCTCTATCTGGTCATCTGTGGCCAGGATGATG 
GTCCTCCCGGCTCAGAGGACCCTGAGCGTGATGACCACGAGGGCCAGCCCCGGCCCCGGGTGCCT 
CGGAAGCGGGGCCACATCTCACCTAAGTCCCGCCCCATGGCCAATTCCACTCTCCTAGGGCTGCT 
GGCCCCGCCTGGGGAGGCTTGGGGCATTCTTGGGCAGCCCCCCAACCGCCCGAACCACAGCCCCC 
CACCCTCAGCCAAGGTGAAGAAAATCTTTGGCTGGGGCGACTTCTACTCCAACATCAAGACGGTG 
GCCCTGAACCTGCTCGTCACAGGGAAGATTGTGGACCATGGCAATGGGACCTTCAGCGTCCACTT 
CCAACACAATGCCACAGGCCAGGGAAACATCTCCATCAGCCTCGTGCCCCCCAGTAAAGCTGTAG 
AGTTCCACCAGGAACAGCAGATCTTCATCGAAGCCAAGGCCTCCAAAATCTTCAACTGCCGGATG 
GAGT GGGAGAAGGTAGAACGGGGCCGCCGGACC T CGC T TT GCACCCACGACCCAGCCAAGAT C T G 
CTCCCGAGACCACGCTCAGAGCTCAGCCACCTGGAGCTGCTCCCAGCCCTTCAAAGTCGTCTGTG 
TCTACATGGCCTTCTACAGCACGGACTATCGGCTGGTCCAGAAGGTGTGCCCAGATTACAACTAC 
CATAGTGATACCCCCTACTACCCATCTGGGTGACCCGGGGCAGGCCACAGAGGCCAGGCCAGGGC 
TGGAAGGACAGGCCTGCCCATGCAGGAGACCATCTGGACACCGGGCAGGGAAGGGGTTGGGCCTC 
AGGCAGGGAGGGGGGTGGAGACGAGGAGATGCCAAGTGGGGCCAGGGCCAAGTCTCAAGTGGCAG 
AGAAAGGGT CCCAAGTGC TGGT CCCAACC TGAAGC TGTGGAGT GAC TAGATCACAGGAGCACTGG 
AGGAGGAGTGGGCTCTCTGTGCAGCCTCACAGGGCTTTGCCACGGAGCCACAGAGAGATGCTGGG 
TCCCCGAGGCCTGTGGGCAGGCCGATCAGTGTGGCCCCAGATCAAGTCATGGGAGGAAGCTAAGC 
CCT TGGT TCT TGCCATCCTGAGGAAAGATAGCAACAGGGAGGGGGAGAT T TCATCAGTGT GGACA 
GCCTGTCAACTTAGGATGGATGGCTGAGAGGGCTTCCTAGGAGCCAGTCAGCAGGGTGGGGTGGG 
GCCAGAGGAGCTCTCCAGCCCTGCCTAGTGGGCGCCCTGAGCCCCTTGTCGTGTGCTGAGCATGG 
CAT GAGGC T GAAGT GGCAAC CC T GGGGTC T T T GAT G T C T TGACAGAT T GACCAT C T G T C T CCAGC 
CAGGCCACCCCTTTCCAAAATTCCCTCTTCTGCCAGTACTCCCCCTGTACCACCCATTGCTGATG 
GCACACCCATCCTTAAGCTAAGACAGGACGATTGTGGTCCTCCCACACTAAGGCCACAGCCCATC 
CGCGTGCTGTGTGTCCCTCTTCCACCCCAACCCCTGCTGGCTCCTCTGGGAGCATCCATGTCCCG 
GAGAGGGGTCCCTCAACAGTCAGCCTCACCTGTCAGACCGGGGTTCTCCCGGATCTGGATGGCGC 
CGCCCTCTCAGCAGCGGGCACGGGTGGGGCGGGGCCGGGCCGCAGAGCATGTGCTGGATCTGTTC 
TGTGTGTCTGTCTGTGGGTGGGGGGAGGGGAGGGAAGTCTTGTGAAACCGCTGATTGCTGACTTT 
TGTGTGAAGAATCGTGTTCTTGGAGCAGGAAATAAAGCTTGCCCCGGGGCA 
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FIGURE Q2 

MQLTRCC FVFLVQG SL YL VI CGQDDGPPGS EDPERDDHE GQPRPRVPRKRGH I S PKSRPMANS TL 
LGLLAPPGEAWGILGQPPNRPNHSPPPSAKVKKIFGWGDFYSNIKTVALNLLVTGKIVDHGNGTF 
SVHFQHNATGQGNI S I SLVPPSKAVEFHQEQQI FIEAKASKI FNCRMEWEKVERGRRTSLCTHDP 
AKICSRDHAQSSATWSCSQPFKWCVYIAFYSTDYRLVQKVCPDYNYHSDTPYYPSG 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 

N-glycosylation sites. 

amino acids 62-65, 127-130, 137-140, 143-146 

2-oxo acid dehydrogenases acyl transferase 

amino acids 61-71 
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FIGURE Q3 



CG GT GGC C ATGA .CTGC GGCC GT GT T CT TCGGC T GCGCC T T CAT T GC C T TCGGGC CT GC GCTCGCC 
CTTTATGTCTTCACCATCGCCATCGAGCCGTTGCGTATCATCTTCCTCATCGCCGGAGCTTTCTT 
CTGGTTGGTGTCTCTACTGATTTCGTCCCTTGTTTGGTTCATGGCAAGAGTCATTATTGACAACA 
AAG AT G G AC C AAC AC AGAAAT AT C T G C T GAT C T T T G GAG CGTTTGTCTCTGTC TAT AT CC AAG AA 
ATGTT CCGAT TTGCATATTATAAAC T CT TAA?AAAA.GCCAGTGAAGGTT TGAAGAGTATAAA.CCC 
AGGTGAGACAGCACCCTCTATGCGACTGCTGGCCTATGTTTCTGGCTTGGGCTTTGGAATCATGA 
M GTGGAGTATTTTCCTTTGTGAATACCCTATCTGACTCCTTGGGGCCAGGCACAGTGGGCATTCAT 
W GGAGATTCTCCTCAATTCTTCCTTTATTCAGCTTTCATGACGCTGGTCATTATCTTGCTGCATGT 
ATTCTGGGGCATTGTATTTTTTGATGGCTGTGAGAAGAAAAAGTGGGGCATCCTCCTTATCGTTC 
| E | TCCT GAC CCACC TGCTGG TGTCAGCCCAGACC T TCAT AAG TT C T TAT TATGGAATAAACCT GGCG 

{II TCAGCATTTATAATCCTGGTGCTCATGGGCACCTGGGCATTCTTAGCTGCGGGAGGCAGCTGCCG 
13 AAGCCTGAAACTCTGCCTGCTCTGCCAAGACAAGAACTTTCTTCTTTACAACCAGCGCTCCAGAT 
AACCT CAGGGAACCAGCAC T T CCCAAACCGCAGAC TACAT C T T TAGAGGAAGCACAACTGTGCC T 
^ TTTTCTGAAAATCCCTTTTTCTGGTGGAATTGAGAAAGAAATAAAACTATGCAGATA 

w 
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FIGURE Q4 



MTAAVFFGCAFIAFGPALALYVFTIAIEPLRIIFLIAGAFFWLVSLLISSLVWFMARVIIDNKDG 
PTQKYLLIFGAFVSVYIQEMFRFAYYKLLKKASEGLKSINPGETAPSMRLLAYVSGLGFGIMSGV 
FSFVNTLSDSLGPGTVGIHGDSPQFFLYSAFMTLVIILLHVFWGIVFFDGCEKKKWGILLIVLLT 
HLLVSAQTFISSYYGINLASAFIILVLMGTWAFLAAGGSCRSLKLCLLCQDKNFLLYNQRSR 



a 

111 



Important features of the protein: 
Signal peptide: 

amino acids 1-19 

Transmembrane domains: 

amino acids 32-51, 119-138, 152-169, 216-235 

Glycosaminoglycan attachment site. 

amino acids 120-123 

Sodium: neurotransmitter symporter family protein 

amino acids 31-65 
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FIGURE 



AATTTTTCACCAGAGTAAACTTGAGAAACCAACTGGACCTTGAGTATTGTACATTTTGCCTCGTG 
GACCCAAAGGTAGCAATCTGAAAC ATGA GGAGTACGATTCTACTGTTTTGTCTTCTAGGATCAAC 
TCGGTCATTACCACAGCTCAAACCTGCTTTGGGACTCCCTCCCACAAAACTGGCTCCGGATCAGG 
GAACAC T ACCAAAC CAACAGCAG T CAAAT CAGG TC TTTCCTTCTT TAAGTC TGAT ACCAT TAACA 
CAGATGCTCACACTGGGGCCAGATCTGCATCTGTTAAATCCTGCTGGAGGAATGACACCTGGTAC 
CCAGACC CACCCAT TGACCCT GGGAGGG T TGAATGTACAACAGCAACTGCACCCACATG T G T TAC 
M CAATTTTT GTCACACAAC T TGGAGCC CAGGGCACTAT CC T AAGC TCAGAGGAAT T GCCACAAATC 

0 TTCACGAGCCTCATCATCCATTCCTTGTTCCCGGGAGGCATCCTGCCCACCAGTCAGGCAGGGGC 
!:J TAATCCAGATGTCCAGGATGGAAGCCTTCCAGCAGGAGGAGCAGGTGTAAATCCTGCCACCCAGG 
Zl GAACCCCAGCAGGCCGCCTCCCAACTCCCAGTGGCACAGATGACGACTTTGCAGTGACCACCCCT 
m GCAGGCAT CCAAAGGAGCACAC AT GC CATCG AGGAAGCC AC CACAGAATCAG CAAAT GGAAT TCA 

0 G TAAG CTGTTTCAAATTTTTTCAACTAAGCTGCCTCGAATTTGGTGATACATGTGAATCTTTATC 
ft 1 AT T GAT TAT AT TAT GGAAT AG AT T GAG AC AC AT T GG ATAG T C T T AG AAGAAAT T AAT T C T T AAT T 

TACC T GAAAATAT TCT TGAAAT T TCAGAAAATATGT T CT AT G T AGAGAAT CCCAACT T T T AAAAA 
CAAT AAT T C AATGGATAAAT C T G TC T T TGAAATATAACAT TAT GCTGCCT GGAT GAT AT GCATAT 
TAAAACATATT TGGAAAACTGGAAAAAAAAAAAAAP^^ 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



PI 



111 
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FIGURE q6 

MRSTILLFGLLGSTRSLPQLKPALGLPPTKLAPDQGTLPNQQQSNQVFPSLSLIPLTQM 
LTLGPDLHLLNPA^GMTPGTQTHPLTLGGLWQQQLHPHVLPIFVTQLGAQGTILSSEE 
LPQIFTSLIIHSLFPGGILPTSQAGANPDVQDGSLPAGGAGVNPATQGTPAGRLPTPSG 
TDDDFAVTTPAGIQRSTHAIEEATTESANGIQ 

Signal peptide: 

amino acids 1-16 



ri 
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FIGURE Q7 



GCTCAAGTGCCCTGCCTTGCCCCACCCAGCCCAGCCTGGCCAGAGCCCCCTGGAGAAGGAGCTCT 
CTTCTTGCTTGGCAGCTGGACCAAGGGAGCCAGTCTTGGGCGCTGGAGGGCCTGTCCTGACCATG 
GTCCCTGCCTGGCTGTGGCTGCTTTGTGTCTCCGTCCCCCAGGCTCTCCCCAAGGCCCAGCCTGC 
AGAGCTGTCTGTGGAAGTTCCAGAAAACTATGGTGGAAATTTCCCTTTATACCTGACCAAGTTGC 
CGCTGCCCCGTGAGGGGGCTGAAGGCCAGATCGTGCTGTCAGGGGACTCAGGCAAGGCAACTGAG 
GGCCCATTTGCTATGGATCCAGATTCTGGCTTCCTGCTGGTGACCAGGGCCCTGGACCGAGAGGA 
GCAGGCAGAGTACCAGCTACAGGTCACCCTGGAGATGCAGGATGGACATGTCTTGTGGGGTCCAC 
AGCCTGTGCTTGTGCACGTGAAGGATGAGAATGACCAGGTGCCCCATTTCTCTCAAGCCATCTAC 
AGAGCTCGGCTGAGCCGGGGTACCAGGCCTGGCATCCCCTTCCTCTTCCTTGAGGCTTCAGACCG 
M GGATGAGCCAGGCACAGCCAACTCGGATCTTCGATTCCACATCCTGAGCCAGGCTCCAGCCCAGC 
0 CTTCCCCAGACATGTTCCAGCTGGAGCCTCGGCTGGGGGCTCTGGCCCTCAGCCCCAAGGGGAGC 
PI ACCAGCCTTGACCACGCCCTGGAGAGGACCTACCAGCTGTTGGTACAGGTCAAGGACATGGGTGA 
JS CCAGGCCTCAGGCCACCAGGCCACTGCCACCGTGGAAGTCTCCATCATAGAGAGCACCTGGGTGT 

ft CCCTAGAGCCT ATC CACC T GGCAGAGAAT C TCAAAG T CC TATAC CC GC AC C AC AT GGCCCAGG TA 

W CACTGGAGTGGGGGTGATGTGCACTATCACCTGGAGAGCCATCCCCCGGGACCCTTTGAAGTGAA 
p T GCAGAGG GAAACC TC TACGTGACCAGAGAGCT GGACAGAGAAGCC CAGGCTGAG TACCT GC TCC 

f:3 AGGTGCGGGCTCAGAATTCCCATGGCGAGGACTATGCGGCCCCTCTGGAGCTGCACGTGCTGGTG 
m ATGGATGAGAATGACAACGTGCCTATCTGCCCTCCCCGTGACCCCACAGTCAGCATCCCTGAGCT 
'•^ CAGT C CACCAGG TACTGAAGT GACTAGAC T G T CAGCAGAGGAT GCAGATGCC CC CGGC TC CCCCA 

ATTCCCACGTTGTGTATCAGCTCCTGAGCCCTGAGCCTGAGGATGGGGTAGAGGGGAGAGCCTTC 
0 CAGGTGGACCCCACTTCAGGCAGTGTGACGCTGGGGGTGCTCCCACTCCGAGCAGGCCAGAACAT 
II CCTGCTTCTGGTGCTGGCCATGGACCTGGCAGGCGCAGAGGGTGGCTTCAGCAGCACGTGTGAAG 
ii| TCGAAGT CGCAG TCACAGAT AT CAAT GAT CACGCCCC TGAGTT CATCAC T T C CCAGAT TGGGCCT 

ATAAGCCTCCCTGAGGATGTGGAGCCCGGGACTCTGGTGGCCATGCTAACAGCCATTGATGCTGA 
3 CCTCGAGCCCGCCTTCCGCCTCATGGATTTTGCCATTGAGAGGGGAGACACAGAAGGGACTTTTG 
13 G C C T G GAT T G GG AG C C AG AC T C T GGG C AT G T T AGAC T C AG AC T C T G C AAG AAC C T CAGT TAT GAG 

|| GCAGCTCCAAGTCATGAGGTGGTGGTGGTGGTGCAGAGTGTGGCGAAGCTGGTGGGGCCAGGCCC 

AGGCCCTGGAGCCACCGCCACGGTGACTGTGCTAGTGGAGAGAGTGATGCCACCCCCCAAGTTGG 
ACCAGGAGAGCTACGAGGCCAGTGTCCCCATCAGTGCCCCAGCCGGCTCTTTCCTGCTGACCATC 
CAGCCCTCCGACCCCATCAGCCGAACCCTCAGGTTCTCCCTAGTCAATGACTCAGAGGGCTGGCT 
CTGCATTGAGAAATTCTCCGGGGAGGTGCACACCGCCCAGTCCCTGCAGGGCGCCCAGCCTGGGG 
ACACCTACACGGTGCTTGTGGAGGCCCAGGATACAGCCCTGACTCTTGCCCCTGTGCCCTCCCAA 
TACCTCTGCACACCCCGCCAAGACCATGGCTTGATCGTGAGTGGACCCAGCAAGGACCCCGATCT 
GGCCAGTGGGCACGGTCCCTACAGCTTCACCCTTGGTCCCAACCCCACGGTGCAACGGGATTGGC 
GCCTCCAGACTCTCAATGGTTCCCATGCCTACCTCACCTTGGCCCTGCATTGGGTGGAGCCACGT 
GAACACATAATCCCCGTGGTGGTCAGCCACAATGCCCAGATGTGGCAGCTCCTGGTTCGAGTGAT 
CGTGTGTCGCTGCAACGTGGAGGGGCAGTGCATGCGCAAGGTGGGCCGCATGAAGGGCATGCCCA 
CGAAGCTGTCGGCAGTGGGCATCCTTGTAGGCACCCTGGTAGCAATAGGAATCTTCCTCATCCTC 
AT TT T CACC CAC T GGACCAT G T CAAG GAAGAAGGACCCGGATCAACCAGC AGACAGCGT GCCCCT 
GAAGGCGACTGTCTGAATGGCCCAGGCAGCTCTAGCTGGGAGCTTGGCCTCTGGCTCCATCTGAG 
T CCCCTGGGAGAGAGCCCAGCACCCAAGAT CCAGCAGGGGACAGGACAGAGTAGAAGCCCCT CCA 
TCTGCCCTGGGGTGGAGGCACCATCACCATCACCAGGCATGTCTGCAGAGCCTGGACACCAACTT 
TATGGACTGCCCATGGGAGTGCTCCAAATGTCAGGGTGTTTGCCCAATAATAAAGCCCCAGAGAA 
CTGGGCTGGGCCCTATGGGAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAG 
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FIGURE Q8 



MVPAWLWLLCVSVPQALPKAQPAELSVEVPENYGGNFPLYLTKLPLPREGAEGQIVLSGDSGKAT 
EGPFMDPDSGFLLVTRALDREEQAEYQLQVTLEMQDGHVLWGPQPVLVHVKDENDQVPHFSQAI 
YRARLSRG T RPG I P FL FLEAS DRDE PG TANS DLRFH I LSQAPAQPS PDMFQLE PRLGALAL S PKG 
STSLDHALERTYQLLVQVKDMGDQASGHQATATVEVS I IEST WVSLEPIHLAENLKVLYPHHMAQ 
VHWSGGDVHYHLESHPPGPFEVNAEGNLYVTRELDREAQAEYLLQVRAQNSHGEDYAAPLELHVL 
VMDENDNVPICPPRDPTVSIPELSPPGTEVTRLSAEDADAPGSPNSHWYQLLSPEPEDGVEGRA 

FQVDPTSGSVTL GVL P LRAGQN I L LLVLAMDLAGAE GG FS S T CE VE VAVT D I NDHAPE F I T S Q I G 
O PISLPEDVEPGTLVAMLTAIDADLEPAFRLMDFAIERGDTEGTFGLDWEPDSGHVRLRLCKNLSY 
Cf EAAPSHEVVVWQSVAKLVGPGPGPGATATVTVLVERVMPPPKLDQESYEASVPI SAPAGS FLLT 

IQPSDPISRTLRFSLVNDSEGWLCIEKFSGEVHTAQSLQGAQPGDTYTVLVEAQDTALTLAPVPS 
S QYLCTPRQDHGLIVSGPSKDPDLASGHGPYSFTLGPNPTVQRDWRLQTLNGSHAYLTLALHWVEP 
{J! REHIIPVWSHNAQMWQLLVRVIVCRCNVEGQCMRKVGRMKGM^ 

3 W LI FTHWTMSRKKDPDQPADSVPLKATV 

s 

Q 

m. Signal peptide: 

Q amino acids 1-18 

r.& Transmembrane domain: 

fit 

amino acids 7 62-784 
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FIGURE QQ 



GGCTGACCGTGCTACATTGCCTGGAGGAAGCCTAAGGAACCCAGGCATCCAGCTGCCCACGCCTG 
AGTCCAAGATTCTTCCCAGGAACACAAACGTAGGAGACCCACGCTCCTGGAAGCACCAGCCTTTA 
TCTCTTCACCTTCAAGTCCCCTTTCTCAAGAATCCTCTGTTCTTTGCCCTCTAAAGTCTTGGTAC 
ATCT AGGACCCAGGCAT C T T GC T T TCCAGCCACAAAGAGACAGATGAAGAT GCAGAAAGGAAAT G 
TTCTCCTTATGTTTGGTCTACTATTGCATTTAGAAGCTGCAACAAATTCCAATGAGACTAGCACC 
TCTGCCAACACTGGATCCAGTGTGATCTCCAGTGGAGCCAGCACAGCCACCAACTCTGGGTCCAG 
TGTGACCTCCAGTGGGGTCAGCACAGCCACCATCTCAGGGTCCAGCGTGACCTCCAATGGGGTCA 
GCATAG T CACCAAC T CT GAGT T CCAT ACAACC T CC AGTGGGAT CAGCACAGCCACCAACT C T GAG 
TTCAGCACAGCGTCCAGTGGGATCAGCATAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGG 
GGCCAGCACAGCCACCAACTCTGAGTCCAGCACACCCTCCAGTGGGGCCAGCACAGTCACCAACT 
O CTGGGTCCAGTGTGACCTCCAGTGGAGCCAGCACTGCCACCAACTCTGAGTCCAGCACAGTGTCC 
%n AGTAGGGCCAGCACTGCCACCAACTCTGAGTCTAGCACACTCTCCAGTGGGGCCAGCACAGCCAC 
m CAAC TCT GACT CCAGCACAACC TCCAGT GGGGC TAGCACAGCCACCAACTCTGAGT CCAGCACAA 

CCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACAGTGTCCAGTAGGGCCAGCACT 
Hf GCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAG 
|i AACGACCTCCAATGGGGCTGGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGGGGCCA 
|j GCACAGCCACCAACTCTGACTCCAGCACAGTGTCCAGTGGGGCCAGCACTGCCACCAACTCTGAG 
m TCCAGCACGACCTCCAGTGGGGCCAGCACAGCCACCAACTCTGAGTCCAGCACGACCTCCAGTGG 
GGC TAGCAC AGCCACCAAC T CT GACT CCAGCACAACC T C CAG T G GGGCCGGCACAGCCACCAACT 
C T GAGTC CAGCACAGTGTCCAGTGGGAT CAGCACAGT CACCAAT TC TG AG TCC AGCAC AC CC TCC 
O AGTGGGGCCAACACAGCCACCAACTCTGAGTCCAGTACGACCTCCAGTGGGGCCAACACAGCCAC 
|1 CAAC T CT GAGT CCAGCACAGT GT C CAGT GGGGCCAGC AC T GCC ACCAAC T C TGAGTCCAGCACAA 

ft| CCTCCAGTGGGGTCAGCACAGCCACCAACTCTGAGTCCAGCACAACCTCCAGTGGGGCTAGCACA 
ft GCCAC CAACT CT GACT CCAGCACAAC C T C CAG T GAGGCCAGCACAGCCACCAACTCT GAGTCT AG 

W C ACAGTGT CCAG TGGGAT CAGCACAGT CACCAAT TCT GAG T CCAGCACAACC T CCAG T G GGGCCA 

&i ACACAGCCACCAACTCTGGGTCCAGTGTGACCTCTGCAGGCTCTGGAACAGCAGCTCTGACTGGA 
fl| ATGCACACAACTTCCCATAGTGCATCTACTGCAGTGAGTGAGGCAAAGCCTGGTGGGTCCCTGGT 

GCCGTGGGAAATCTTCCTCATCACCCTGGTCTCGGTTGTGGCGGCCGTGGGGCTCTTTGCTGGGC 
TCTTCTTCTGTGTGAGAAACAGCCTGTCCCTGAGAAACACCTTTAACACAGCTGTCTACCACCCT 
CATGGCCTCAACCATGGCCTTGGTCCAGGCCCTGGAGGGAATCATGGAGCCCCCCACAGGCCCAG 
G TGGAG TCC TAACTGGT T C T GGAGGAGAC CAGTATCAT CGAT AGCCATGGAGAT GAGC GGGAGGA 
ACAGCGGGCCCTGAGCAGCCCCGGAAGCAAGTGCCGCATTCTTCAGGAAGGAAGAGACCTGGGCA 
CCCAAGACCTGGTTTCCTTTCATTCATCCCAGGAGACCCCTCCCAGCTTTGTTTGAGATCCTGAA 
AAT CT TGAAGAAGGTAT T CCT CACCT TTC T TGCCT T TACCAGACAC T GGAAAGAGAATACTATAT 
T G C T CAT T T AGC T AAG AAAT AAAT AC AT C T CAT C T AAC AC AC AC G AC AAAGAG AAG CTGTGCTTG 
CCCCGGGGTGGGTATCTAGCTCTGAGATGAACTCAGTTATAGGAGAAAACCTCCATGCTGGACTC 
CAT CT GGCATT CAAAAT C T CCACAGT AAAATCC AAAGAC CT CAAAAAAAAAAAAAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



AppJD=10063602 



Page 249 of 31 8 



FIGURE 100 



MKMQKGNVLLMFGLLLHLEAATNSNET S T S ANTGS S VI S S GAS TATNS GS S VTS S GVS TAT I SGS 
SVTSNGVSIVTNSEFHTTSSGISTATNSEFSTASSGISIATNSESSTTSSGASTATNSESSTPSS 
GASTVTNSGSSVTSSGASTATNSESSTVSSRASTATNSESSTLSSGASTATNSDSSTTSSGASTA 
TNSESSTTSSGASTATNSESSTVSSRASTATNSESSTTSSGASTATNSESRTTSNGAGTATNSES 
STTSSGASTATNSDSSTVSSGASTATNSESSTTSSGASTATNSESSTTSSGASTATNSDSSTTSS 
GAGTATNSESSTVSSGISTVTNSESSTPSSGANTATNSESSTTSSGANTATNSESSTVSSGASTA 
TNSESSTTSSGVSTATNSESSTT5SGASTATNSDSSTTSSEASTATNSESSTVSSGISTVTNSES 
STTSSGANTATNSGSSVTSAGSGTAALTGMHTTSHSASTAVSEAKPGGSLVPWEIFLITLVSWA 
AVGL FAGL FFCVRNS L S LRNT FNT AVYHPHGLNHGLGPGPGGNHGAPHRPRWS PNWFWRRPVS S I 
AMEMSGRNSGP 

Signal peptide: 

amino acids 1-20 

Transmembrane domain: 

amino acids 510-532 
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FIGURE 101 

GGCCGGACGCCTCCGCGTTACGGGATGAATTAACGGCGGGTTCCGCACGGAGGTTGTGACCCCTA 
CGGAGCCCCAGCTTGCCCACGCACCCCACTCGGCGTCGCGCGGCGTGCCCTGCTTGTCACAGGTG 
GGAGGCTGGAACTATCAGGCTGAAAAACAGAGTGGGTACTCTCTTCTGGGAAGCTGGCAACAAAT 
GGATGATGTGATATATGCATTCCAGGGGAAGGGAAATTGTGGTGCTTCTGAACCCATGGTCAATT 
AACGAGGGAG T T TC TAGC TACT GCAC GTACT TCATAAAGCAGGAC T C TAAAAGC T T T GGAAT CAT 
GGTGTCATGGAAAGGGATTTACTTTATACTGACTCTGTTTTGGGGAAGCTTTTTTGGAAGCATTT 
TCATGCTGAGTCCCTTTTTACCTTTGATGTTTGTAAACCCATCTTGGTATCGCTGGATCAACAAC 
CGCCTTGTGGCAACATGGCTCACCCTACCTGTGGCATTATTGGAGACCATGTTTGGTGTAAAAGT 
GAT TATAACTGGGGATGCATT T GTTCCTGGAGAAAGAAGTGTCATTATCATGAACCAT CGGACAA 
GAATGGACTGGATGTTCCTGTGGAATTGCCTGATGCGATATAGCTACCTCAGATTGGAGAAAATT 
TGCCTCAAAGCGAGTCTCAAAGGTGTTCCTGGATTTGGTTGGGCCATGCAGGCTGCTGCCTATAT 
CTTCATTCATAGGAAATGGAAGGATGACAAGAGCCATTTCGAAGACATGATTGATTACTTTTGTG 
ATAT TCACGAACCACT TCAACT CCTCATATTCCCAGT^AGGGACTGATCTCACAGAAAACAGCAAG 
T C T C G AAG T AAT GC AT T T G C T G AAAAAAAT GGAC T T C AG AAAT AT GAAT AT G T T T T AC AT C CAAG 
AACTACAGGCT T TACT TT T GTGG T AGACC GTC TAAGAGAAGGT AAGAACC T TGAT GCT GT CCAT G 
ATATCACTGTGGCGTATCCTCACAACATTCCTCAATCAGAGAAGCACCTCCTCCAAGGAGACTTT 
CCCAGGGAAATCCACTTTCACGTCCACCGGTATCCAATAGACACCCTCCCCACATCCAAGGAGGA 
CCTTCAACTCTGGTGCCACAAACGGTGGGAAGAGAAAGAAGAGAGGCTGCGTTCCTTCTATCAAG 
GGGAGAAGAAT TT TTAT TT TACCGGACAGAG TGTCAT TCCACCT T GCAAGTCTGAACT CAGGGTC 
CTTGTGGTCAAATTGCTCTCTATACTGTATTGGACCCTGTTCAGCCCTGCAATGTGCCTACTCAT 
ATATTTGTACAGTCTTGTTAAGTGGTATTTTATAATCACCATTGTAATCTTTGTGCTGCAAGAGA 
GAATATT TGGTGGACTGGAGATCATAGAACT TGCATGTTACCGACTT TTACACAAACAGCCACAT 
T TAAAT T CAAAGAAAAAT GAGTAAGAT TATAAGGT T T GCCAT GT GAAAAC C TAGAGCATAT T T TG 
GAAATGTTCTAAACCTTTCTAAGCTCAGATGCATTTTTGCATGACTATGTCGAATATTTCTTACT 
GCCATCATTATTTGTTAAAGATATTTTGCACTTAATTTTGTGGGAAAAATATTGCTACAATTTTT 
T T T AAT C T C T GAAT G T AAT T T C GAT AC T G T G T AC AT AG C AG G GAG T GAT C GG G G T G AAAT AAC T T 
GGGCCAGAATAT TAT TAAAC AATC AT CAG GC T T T TAAA 



FIGURE 102 

MHSRGREIWLLNPWSINEAVSSYCTYFIKQDSKSFGIMVSWKGIYFILTLFWGSFFGSIFMLSP 
FLPLMFVNPS WYRWINNRLVATWLTLPVALLETMFGVKVI ITGDAFVPGERSVI IMNHRTRMDWM 
FLWNCIjMRYSYLRLEKICLKASLKGVPGFGWAMQAAAYIFIHRKWKDDKSHFEDMIDYFCDIHEP 
LQLLIFPEGTDLTENSKSRSNAFAEKNGLQKYEYVLHPRTTGFTFWDRLREGKNLDAVHDITVA 
YPHMIPQSEKHLLQGDFPREIHFHVHRYPIDTLPTSKEDLQLWCHKRWEEKEERLRSFYQGEKNF 
YFTGQSVIPPCKSELRVLWKLLSILYWTLFSPAMCLLIYLYSLVKWYFIITIVIFVLQERIFGG 
M LEI IELACYRLLHKQPHLNSKKNE 

^ Important features of the protein: 

y Signal peptide: 

IP amino acids 1-22 

?w Transmembrane domains: 

ff| amino acids 44-63, 90-108, 354-377 



I'll 
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FIGURE 1Q3 



CGGCTCGAGCGGCTCGAGTGAAGAGCCTCTCCACGGCTCCTGCGCCTGAGACAGCTGGCCTGACC 
TCCAAATCATCCATCCACCCCTGCTGTCATCTGTTTTCATAGTGTGAGATCAACCCACAGGAATA 
TCCATGGCTTTTGTGCTCATTTTGGTTCTCAGTTTCTACGAGCTGGTGTCAGGACAGTGGCAAGT 
CACTGGACCGGGCAAGTTTGTCCAGGCCTTGGTGGGGGAGGACGCCGTGTTCTCCTGCTCCCTCT 
TTCCTGAGACCAGTGCAGAGGCTATGGAAGTGCGGTTCTTCAGGAATCAGTTCCATGCTGTGGTC 
' CACCTCTACAGAGATGGGGAAGACTGGGAATCTAAGCAGATGCCACAGTATCGAGGGAGAACTGA 
GTTTGTGAAGGACTCCATTGCAGGGGGGCGTGTCTCTCTAAGGCTAAAAAACATCACTCCCTCGG 
ACAT C GG CC T G TAT GG GT GCT GG T T C AG T T CCC AGAT T T AC GAT GAGGAG GC CACC T G GGAGC T G 
CGGGTGGCAGCACTGGGCTCACTTCCTCTCATTTCCATCGTGGGATATGTTGACGGAGGTATCCA 
H GTTACTCTGCCTGTCCTCAGGCTGGTTCCCCCAGCCCACAGCCAAGTGGAAAGGTCCACAAGGAC 
fjl AGGATTTGTCTTCAGACTCCAGAGCAAATGCAGATGGGTACAGCCTGTATGATGTGGAGATCTCC 
fh AT T AT AGT CCAGGAAAATGCT GGGAGCATAT TG T GT T CCATCC ACCT TGC T GAGCAGAG T CAT GA 

5 GGTGGAATCCAAGGTATTGATAGGAGAGACGTTTTTCCAGCCCTCACCTTGGCGCCTGGCTTCTA 
; lF J TTTTACTCGGGTTACTCTGTGGTGCCCTGTGTGGTGTTGTCATGGGGATGATAATTGTTTTCTTC 
W AAATCCAAAGGGAAAATCCAGGCGGAACTGGACTGGAGAAGAAAGCACGGACAGGCAGAATTGAG 
Ci AGACGCC CGGAAACAC GCAGT GGAGGTGAC TCT GGAT CCAGAGACGGC TCACCC GAAGCT CT GC G 

f;J| T TT CT GATCTGAAAACTGTAACCCATAGAAAAGCTCCCCAGGAGGTGCCTCACT CT GAGAAGAGA 

|S TTTACAAGGAAGAGTGTGGTGGCTTCTCAGGGTTTCCAAGCAGGGAGACATTACTGGGAGGTGGA 
' w CGTGGGACAAAATGTAGGGTGGTATGTGGGAGTGTGTCGGGATGACGTAGACAGGGGGAAGAACA 
5 ^ ATGTGACTTTGTCTCCCAACAATGGGTATTGGGTCCTCAGACTGACAACAGAACATTTGTATTTC 
Q ACATTCAATCCCCATTTTATCAGCCTCCCCCCCAGCACCCCTCCTACACGAGTAGGGGTCTTCCT 
if GGACTATGAGGGTGGGACCATCTCCTTCTTCAATACAAATGACCAGTCCCTTATTTATACCCTGC 
S TGACATGTCAGTTTGAAGGCTTGTTGAGACCCTATATCCAGCATGCGATGTATGACGAGGAAAAG 

GGGACTCCCATATTCATATGTCCAGTGTCCTGGGG ATGA GACAGAGAAGACCCTGCTTAAAGGGC 
18 CCCACAC CACAGACCCAGACACAGCCAAG GGAGAG TGC T CC CGACAGG TGGC CC CAGC T T CC TC T 

13 CCGGAGCCTGCGCACAGAGAGTCACGCCCCCCACTCTCCTTTAGGGAGCTGAGGTTCTTCTGCCC 
fl TGAGCCCTGCAGCAGCGGCAGTCACAGCTTCCAGATGAGGGGGGATTGGCCTGACCCTGTGGGAG 

T CAGAAGCCAT GGC T GCCC T GAAGTGGGGAC GGAATAGAC T CACAT TAGG T T TAGT T T G T GAAAA 
CTCCATCCAGCTAAGCGATCTTGAACAAGTCACAACCTCCCAGGCTCCTCATTTGCTAGTCACGG 
ACAG T GAT T CC TGCCT CACAG GT GAAGAT TAAAGAGACAACGAAT G TGAATC ATGCT T GCAGG T T 
TGAGGGCACAGTGTTTGCTAATGATGTGTTTTTATATTATACATTTTCCCACCATAAACTCTGTT 
TGCT TATTCCACAT TAAT T T AC T T T T C TC TATACCAAAT CACC CAT GGAATAGT TAT T GAACACC 
TGCT T TGTGAGGCTCAAAGAATAAAGAGGAGGTAGGAT TT TTCACTGATTCTATAAGCCCAGCAT 
TACCTGATACCAAAACCAGGCAAAGAAAACAGAAGAAGAGGAAGGAAAACTACAGGTCCATATCC 
C T CAT T AAC ACAGAC AC AAAAAT T C T AAAT AAAAT T T T AAC AAAT T AAAC TAAACAAT AT AT T TA 
AAGATGATATATAACTACTCAGTGTGGTTTGTCCCACAAATGCAGAGTTGGTTTAATATTTAAAT 
AT CAACC AGTGTAAT T CAGCAC AT TAATAAAGT AAAAAAGAAAACGATAAAAAAAAAAAAAAA 
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FIGURE 104 

MAFVLILVLSFYELVSGQWQVTGPGKFVQALVGEDAVFSCSLFPETSAEAMEVRFFRNQFHAWK 
LYRDGEDWESKQMPQYRGRTEFVKDSIAGGRVSLRLKNITPSDIGLYGCWFSSQIYDEEATWELR 
VAAIiGSLPLISIVGWDGGIQLLCLSSGWFPQPTAKWKGPQGQDLSSDSRANADGYSLYDVEISI 
IVQENAGSILCSIHLAEQSHEVESKVLIGETFFQPSPWRLASILLGLLCGALCGVVMGMIIVFFK 
SKGKIQAELDWRRKHGQAELRDARKHAVEVTLDPETAHPKLCVSDLKTVTHRKAPQEVPHSEKRF 
TRKSWASQGFQAGRHYWEVDVGQNVGWYVGVCRDDVDRGKNNVTLSPNNGYWVLRLTTEHLYFT 
FNPHFISLPPSTPPTRVGVFLDYEGGTISFFNTNDQSLIYTLLTCQFEGLLRPYIQHAMYDEEKG 
TPIFICPVSWG 

Signal peptide: 

amino acids 1-17 

Transmembrane domains: 

amino acids 131-150, 235-259 
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FIGURE IPS 



CCTTCACAGGACTCTTCATTGCTGGTTGGCA ATGA TGTATCGGCCAGAT6TGGTGAGGGCTAGGAAAAGAG 
TTTGTTGGGAACCCTGGGTTATCGGCCTCGTCATCTTCATATCCCTGATTGTCCTGGCAGTGTGCATTGGA 
CT CACTGTT CAT TAT GTG AG AT AT AATC AAAAGAAG AC CT AC AAT T ACT AT AGC AC AT TGTC AT T T AC AAC 
TGAC AAACTATATGCTG AGT T T GGC AG AG AGG CT TCT AACAATT TT AC AG AAAT G AGCCAG AGACT T G AAT 
CAATGGTGAAAAATGCATTTTATAAATCTCCATTAAGGGAAGAATTTGTCAAGTCTCAGGTTATCAAGTTC 
AGTCAACAG AAGCAT GGAGT GT TG GC TCAT AT GCT GTTGATT TGT AGATT T CACTC T AC TGAGG AT CCTG A 
AACTGTAGATAAAATTGTTCAACTTGTTTTACATGAAAAGCTGCAAGATGCTGTAGGACCCCCTAAAGTAG 
p. AT CCT C ACTC AGTT AAAATT AAAAAAATCAAC AAGACAG AAACAGAC AGCTATCT AAACC AT TGCTGC GG A 

AC ACGAAG AAGT AAAACT CT AGGT C AGAGTCT CAGG ATCGTT GGT GGG AC AGAAGT AGAAGAGGGT GAATG 
fWk GC CC T GGC AG GCT AGC CT GCAGT GGG AT GGGAGT C ATCGCTGTGGAGC AACCT T AATT AATGCCAC AT GGC 

|f| TTGTGAGTGCTGCTCACTGTTTTACAACATATAAGAACCCTGCCAGATGGACTGCTTCCTTTGGAGTAACA 
p ATAAAACCTTCGAAAATGAAACGGGGTCTCCGGAGAATAATTGTCCATGAAAAATACAAACACCCATCACA 
*Ci TGACT ATG AT AT TT CT CT T GCAGAGC TTTC TAGC CC TGTTCCCT AC ACAAAT GC AGTACATAGAGTT T GTC 

TCCCTGATGC ATCCT ATG AGTT TCAACCAGGT GATGTGATGTTT GTGACAGGAT TTGGAGCACTGAAAAAT 
GATGGTTACAGTCAAAATCATCTT CGACAAGC ACAGGTGACT CT CATAGACGCT ACAACTTGCAATGAACC 
TCAAGCTTACAATGACGCCATAACTCCTAGAATGTTATGTGCTGGCTCCTTAGAAGGAAAAACAGATGCAT 
GCCAGGGTGACT CT GG AG G ACC AC TG GT T AGT T C AG ATGCT AGAGAT ATC T GGT ACCT TG CT GGAAT AGTG 
AGCT GGGGAGAT GAATGT GCG AAACC CAACAAGCCT GGTGTT TAT ACT AG AGT T AC GGCCTT GCGGGACTG 
g5i GAT T AC TTCAAAAACT GGT AT CTAAGAG AC AAAAGC CT CATGGAAC AG AT AAC AT TTTTTTTTGTTTTT TG 

GGTGTGGAGGCC ATT T TT AGAGAT AC AG AAT T GG AG AAGACT T GC AAAAC AGCT AG ATTT GACT GATCTC A 
ATAAACTGTTTGCTTGATGCATGTATTTTCTTCCCAGCTCTGTTCCGCACGTAAGCATCCTGCTTCTGCCA 
GATC AACTCT GT CAT C TGTGAGC AAT AGT TGAAACTT T AT GT AC AT AGAG AAAT AG AT AAT ACAAT AT T AC 
ATTACAGCCT GTATTC ATTTGTTCTCTAGAAGTT TTGTCAGAATTTTGACTT GT TG AC AT AAAT T TGT AAT 
GC AT AT AT AC AATT TG AAGC AC TC CTTT T C T T CAGT TCCT CAGCT CCT CT CAT T TC AGC AAAT AT C C ATTT 
TC AAGG TGCAGAACAAGG AGT G AAAG AAAAT ATAAG AAGAAAAAAATCCCCT AC AT TTT ATT GGC AC AGAA 
AAGTATTAGGTGTTTTTCTTAGTGGAATATTAGAAATGATCATATTCATTATGAAAGGTCAAGCAAAGACA 
GCAGAATACCAATCACTTCATCATTTAGGAAGTATGGGAACTAAGTTAAGGAAGTCCAGAAAGAAGCCAAG 
AT AT AT CCTT AT T T T C ATTT CC AAAC AACT ACT ATG AT AAAT GT GAAG AAG AT T CT GTTTTT T T GT GAC CT 
AT AAT AATTAT ACAAACT T CAT GC AATGTACT T GTT CT AAGC AAAT T AAAG C AAAT AT TT AT T T AACAT TG 
TTACTGAGGATGTCAACATATAACAATAAAATATAAATCACCCA 
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FIGURE 106 

MMYRPDVVRARKRVCWEPWVIGLVIFISLIVIAVCIGLTVHYVRYNQKKTYNYYSTLSFTTDKLY 
AEFGREASNNFTEMSQRLESMVKNAFYKSPLREEFVKSQVIKFSQQKHGVLAHMLLICRFHSTED 
PETVDKIVQLVLHEKLQDAVGPPKVDPHSVKIKKINKTETDSYLNHCCGTRRSKTLGQSLRIVGG 
TEVEEGEWPWQASLQWDGSHRCGATLINATWLVSAAHCFTTYKNPARWTAS FGVTIKPSKMKRGL 
RRI I VHEKYKHP S HDYDI S LAE LSS PVP YTNAVHRVCLPDAS YE FQPGDVMFVT G FGALKNDGYS 
QiJHLRQAQVTLIDATTCNEPQAYNDAITPRMLCAGSLEGKTDACQGDSGGPLVSSDARDIWYLAG 
j£ IVSWGDECAKPNKPGVYTRVTALRDWITSKTGI 

fk Transmembrane domain: 

||| amino acids 21-4 0 (type II) 

••: 
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FIGURE 107 



AGAGAAAGAAGCGTCTCCAGCTGAAGCCAATGCAGCCCTCCGGCTCTCCGCGAAGAAGTTCCCTG 
CCCCGATGAGCCCCCGCCGTGCGTCCCCGACTATCCCCAGGCGGGCGTGGGGCACCGGGCCCAGC 
GCCGACGATCGCTGCCGTTTTGCCCTTGGGAGTAGGATGTGGTGAAAGGATGGGGCTTCTCCCTT 
ACGGGGCTCACAATGGCCAGAGAAGATTCCGTGAAGTGTCTGCGCTGCCTGCTCTACGCCCTCAA 
TGTGCTCTTTTGGTTAATGTCCATCAGTGTGTTGGCAGTTTCTGCTTGGATGAGGGACTACCTAA 
ATAAT GT T C T CACT T T AAC T G C AG AAACG AG GG T AG AGGAAG C AG T CAT T T T G AC T T AC T T T C C T 
GTGGTTCATCCGGTCATGATTGCTGTTTGCTGTTTCCTTATCATTGTGGGGATGTTAGGATATTG 
TGGAACGGTGAAAAGAAATCTGTTGCTTCTTGCATGGTACTTTGGAAGTTTGCTTGTCATTTTCT 
GTGTAGAACTGGCTTGTGGCGTTTGGACATATGAACAGGAACTTATGGTTCCAGTACAATGGTCA 
3 . GAT AT GGT CACT TT GAAAGCCAG GAT GACAAAT TAT GGATT ACCTAGATATCGG TGGC T T AC TCA 

TGCT TGGAAT TTTT T TCAGAGAGAGT TTAAGTGCTGT GGAGTAG T ATATTTCAC TGAC TGGTTGG 
D AAAT GACAGAGATGGACTGGCC CC CAGAT T CCT GCTGTGT T AGAGAAT TC CCAGGATGT T CC AAA 

0 CAGGCCCACCAGGAAGATCTCAGTGACCTTTATCAAGAGGGTTGTGGGAAGAAAATGTATTCCTT 
rf| TTTGAGAGGAACCAAACAACTGCAGGTGCTGAGGTTTCTGGGAATCTCCATTGGGGTGACACAAA 
ill TCCTGGCCATGATTCTCACCATTACTCTGCTCTGGGCTCTGTATTATGATAGAAGGGAGCCTGGG 
!J: ACAGACCAAATGATGTCCTTGAAGAATGACAACTCTCAGCACCTGTCATGTCCCTCAGTAGAACT 
^ G T T GAAAC C AAG CC T G T C AAGAAT C T T T G AAC ACAC ATC CAT GGCAAAC AGC T T T AAT AC ACAC T 

|S T TGAGAT GGAGGAGT TATAAAAAGAAATG T CACAGAAGAAAACCACAAACT TGT T TTAT TGGACT 

jfj TGT GAAT T T TT GAG TACATACT AT G T G T T T CAGAAAT AT G TAGAAATAAAAATGT TGCCATAAAA 

T AAC AC C T AAGCAT AT AC TAT TCTATGCTT T AAAAT GAG G AT GG AAAAG T T T CAT G T CAT AAG T C 
ACCACCTGGACAATAATTGATGCCCTTAAAATGCTGAAGACAGATGTCATACCCACTGTGTAGCC 
T GT G TAT GACT T T TAC TGAACACAG T TAT GT T T TGAGGCAGCAT GGT T TGAT TAGC AT T TCCGCA 
TCCAT GCAAACGAGTCACAT AT GGTGGGACTGGAGCCATAGTAAAGGTTGAT TTACTTCTACCAA 
CTAGT ATATAAAGTAC TAAT TAAATGCTAACATAGGAAGT TAGAAAATACTAATAACT TTTATTA 
CTCAGCGATCTATTCT TC TGAT GCTAAATAAAT TATATATCAGAAAAC TT TCAATATT GGTGACT 
AC C T AAAT G TGA TTTTTGCTGGT TAC T AAAAT AT T C T TAC C AC T T AAAAG AG CAAGC T AAC AC AT 
TGTCTTAAGCTGATCAGGGATTTTTTGTATATAAGTCTGTGTTAAATCTGTATAATTCAGTCGAT 
T§ T T CAGT T C T GAT AAT G T TAAGAAT AACCAT TAT GAAAAGGAAAAT T TG T CC T G T AT AGCAT CAT T 

ATTTTTAGCCTTTCCTGTTAATAAAGCTTTACTATTCTGTCCTGGGCTTATATTACACATATAAC 
TGT TAT T T AAAT AC T T AAC CACT AAT T T TGAAAAT T ACC AGT GT GAT ACATAGGAAT CAT TAT T C 
AG AAT G T AGTCT GGTC T T TAGGAAG T ATTAATAAGAAAAT T T GCACATAAC T TAG T TGAT TCAGA 
AAGGACTTGTATGCTGTTTTTCTCCCAAATGAAGACTCTTTTTGACACTAAACACTTTTTAAAAA 
GC T TAT CTTTGCCTTCTC CAAACAAGAAG CAAT AG T C T C CAAGT CAAT AT AAAT T C T A.CAGAAAA 
T AGT GTTCTTTTTCTC CAG AAAAAT G C T T G T GAG AAT CAT T AAAAC AT G T G ACAAT T T AGAGAT T 
CTTTGTTT TAT T T C AC T GAT TAAT AT AC T G T G G C AAAT T AC AC AGAT TAT T AAAT T T T T T T ACAA 
GAGTAT AG T ATAT T TAT T T GAAAT GGGAAAAGTGC AT T T TAC T GTAT T T T G T G TAT T T T GT T TAT 
T TCTCAGAATATGGAAAGAAAAT TAAAATGTGTCAATAAATAT TT TCTAGAGAGTAA 
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FIGURE 108 

MAREDSWCLRCLLYALNLLFWLMSISVLA 

VMIAVCCFLI IVGMLGYCGTVKRNLLLLAWYFGSLLVI FC VELACG VW T YE QE LMVPVQWS DMVT 
LKAI^TNYGLPRYRWLTHAWNFFQREFKCCGVVYFTDWLEMTEMDWPPDSCCVREFPGCSKQAHQ 
EDLSDLYQEGCGKKMYSFLRGTKQLQVLRFLGISIGVTQILAMILTITLLWALYYDRREPGTDQM 
MSLKNDNSQHLSCP S VELLKPS LSRI FEHTSMANS FNTHFEMEEL 



Signal peptide: 

amino acids 1-33 



Transmembrane domains: 

ff|. amino acids 12-35, 57-86, 94-114, 226-248 



m 



hi 



5 Tr 
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FIGURE 1QQ 



CCAAGGCCAGAGCTGTGGACACCTTATCCCACTCATCCTCATCCTCTTCCTCTGATAAAGCCCCTACCAGTGCT 
GAT AAAGT CT T TC TC GT G AGAGC CT AG AG GCCT T AAAAAAAAAAGT GCT T GAAAGAG AAG GGG AC AAAGG AACA 
CCAGTATTAAGAGGATTTTCCAGTGTTTCTGGCAGTTGGTCCAGAAGGATGCCTCCATTCCTGCTTCTCACCTG 
CCTCTTCATCACAGGCACCTCCGTGTCACCCGTGGCCCTAGATCCTTGTTCTGCTTACATCAGCCTGAATGAGC 
CCT GGAG GAAC AC T GAC C AC CAG T T GG AT GAGT CT C AAG GTCCT CCT CT AT GT GAC AAC C AT GT GAAT G GG GAG 
TGGT ACC ACT T CACGGGC AT GGC GGGAGATGCCAT GCCT ACCT T CT GC ATAC CAGAAAACCACT GT GGAACCCA 
CGCACCTGTCTGGCTCAATGGCAGCCACCCCCTAGAAGGCGACGGCATTGTGCAACGCCAGGCTTGTGCCAGCT 
TCAATGGGAACTGCTGTCTCTGGAACACCACGGTGGAAGTCAAGGCTTGCCCTGGAGGCTACTATGTGTATCGT 
CT GAC C AAGC CC AGC GT CT GCT T CC AC GT C T AC T GT GGT CAT T T TT AT GAC ATCT GC GAC GAGGACT GCC AT GG 
C AGCT GCT CAG AT AC C AGC GAGT GC AC AT GCGCT C CAGGAACT GT GC T AGGC CC T GAC AG GC AGACAT GCT T T G 
ATGAAAAT GAAT GTGAGCAAAAC AACGGTGGCTGCAGT GAGAT CTGT GT GAACCTCAAAAACTC CTACC GCT GT 
GAGTGTGGGGTT GGCCGTGT GCT AAGAAGT GAT GGCAAGACTT GTGAAGACGTT GAAGGAT GCC ACAAT AAC AA 
TGGTGGCTGCAGCCACTCTTGCCTTGGATCTGAGAAAGGCTACCAGTGTGAATGTCCCCGGGGCCTGGTGCTGT 
C T GAGGAT AACC AC A CT T GC CAAGT CC C T GT G TT GT GC AAAT CAAAT GCC AT TGAAGT GAAC AT CCCCAGG GAG 
CT GGT T G GT GGC CT GGAGC T CT T CC T GAC C AAC AC CT CC T GCC GAGGAGT GT CC AAC G GC AC CC AT GT C AAC AT 
CCT CT TCT CT CT C AAGACAT GT G GT AC AGT G GT CGAT GT GGT GAAT GAC AAG AT T GT G GC CAGC AACC TC GT GA 
CAGGTCTACCCAAGCAGACCCCGGGGAGCAGCGGGGACTTCAT CATC CGAACC AGC AAGC T GCT GATCCCGGTG 
ACCT GC GAGTT TCCACGCCT GT ACACC ATTTCT GAAGGAT AC GT TCC CAACC TT CGAAACT CCCCACT GGAAAT 
CATGAGCCGAAATCATGGGATCTTCCCATTCACTCTGGAGATCTTCAAGGACAATGAGTTTGAAGAGCCTTACC 
GGGAAGCTCTGCCCACCCTCAAGCTTCGTGACTCCCTCTACTTTGGCATTGAGCCCGTGGTGCACGTGAGCGGC 
TT GGAAAGCT T GGT GGAGAGCT GCTTT GCC ACCCCCACC TCCAAGAT CGACGAGGT CCTGAAATACTACCTCAT 
CCGGGAT GGCT GT GT TT C AGAT GACTC GGT AAAGC AGTACACATCCC GGGAT CACCT AGCAAAGCACT TCCAGG 
TCCCTGTCTTCAAGTTTGTGGGCAAAGACCACAAGGAAGTGTTTCTGCACTGCCGGGTTCTTGTCTGTGGAGTG 
TTGGACGAGCGTTCCCGCTGTGCCCAGGGTTGCCACCGGCGAATGCGTCGTGGGGCAGGAGGAGAGGACTCAGC 
CGGTCTACAGGGCCAGACGCTAACAGGCGGCCCGATCCGCATCGACTGGGAGGACTAGTTCGTAGCCATACCTC 
GAGTCCCTGCATTGGACGGCTCTGCTCTTTGGAGCTTCTCCCCCCACCGCCCTCTAAGAACATCTGCCAACAGC 
TGGGTTCAGACTTCACACTGTGAGTTCAGACTCCCAGCACCAACTCACTCTGATTCTGGTCCATTCAGTGGGCA 
CAGGT CACAGC ACT GCT GAACAAT GTGGCCT GGGT GGGGTTTC ATCT TTCTAGGGT TGAAAACTAAAC TGT CCA 
CCCAGAAAGACACTCACCCCATTTCCCTCATTTCTTTCCTACACTTAAATACCTCGTGTATGGTGCAATCAGAC 
CACAAAATCAGAAGCTGGGT ATAATAT TTCAAGTTACAAACCCTAGAAAAATTAAACAGTTACT GAAATTAT GA 
CTTAAATACCCAATGACTCCTTAAATATGTAAATTATAGTTATACCTTGAAATTTCAATTCAAATGCAGACTAA 
TT AT AGGGAAT T T GGAAGT GT AT C AAT AAAAC AGT AT AT AAT T T T 



FIGURE no 



MPPFLLLTCLFITGTSVSPVALDPCSAYISLNEPWRNTDHQLDESQGPPLCDNHVNGBWYHFTGMAGDAMP 
TFCIPENHCGTHAPWLNGSHPLEGDGIVQRQACASFNGNCCLWNTTVEVKACPGGYYVYRLTKPSVCFHV 
YCGHFYDICDEDCHGSCSDTSECTCAPGTVLGPDRQTCFDENECEQNNGGCSEICVNLKNSYRCECGVGRV 
LRSDGKTCEDVEGCHNNNGGCSHSCLGSEKGYQCECPRGLVLSEDNHTCQVPVLCKSNAIEVNIPRELVGG 
LELFLTNTSCRGVSNGTHVNILFSLKTCGTWDWNDKIVASNLVTGLPKQTPGSSGDFIIRTSKLLIPVT 
CEFPRLYTISEGYVPNLRNSPLEIMSRNHGIFPFTLEIFKDNEFEEPYREALPTLKLRDSLYFGIEPWHV 
SGLE SLVE SC FATPT SKI DE VLKY YLI RDGCVSDDSVKQYTS RDHLAKH FQVPVFKFVGKDHKEVFLHCRV 
LVCGVLDERSRCAQGCHRRMRRGAGGEDSAGLQGQTLTGGPIRIDWED 



Important features of the protein: 
Signal peptide: 

amino acids 1-16 

N-glycosylation sites. 



^ amino acids 89-93, 116-120, 259-263, 291-295, 299-303 

wi- 
ll 

Tyrosine kinase phosphorylation sites. 

fj| amino acids 411-418, 443-451 

I'll N-myristoylation sites. 

amino acids 226-232, 233-239, 240-246, 252-258, 296-302, 300-306, 
522-528, 531-537 



Aspartic acid and asparagine hydroxy la tion site. 

amino acids 197-209 



ZP domain proteins. 

amino acids 431-457 



Calcium-binding EGF-like proteins. 

amino acids 191-212, 232-253 
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FIGURE ill 



II 



GAGAGAGGCAGCAGCTTGCTCAGCGGACAAGGATGCTGGGCGTGAGGGACCAAGGCCTGCCCTGCACTCGG 
GCCTCCTCCAGCCAGTGCTGACCAGGGACTTCTGACCTGCTGGCCAGCCAGGACCTGTGTGGGGAGGCCCT 
CCTGCTGCCTTGGGGTGACAATCTCAGCTCCAGGCTACAGGGAGACCGGGAGGATCACAGAGCCAGC ATGT 
T ACAGGAT CCT G AC AGTGAT CAAC CT CTGAAC AGC C TCG AT G TC AAACCCCT GCGC AAACCC CGT AT C CC C 
AT GG AG ACCT TC AGAAAGGT GGGGATCCCC AT CAT C AT AGC ACT ACTG AGCCTGGC G AGT AT CATC ATTG T 
GGT T GTCCT CAT CAAGGT GATT CT GG AT AAAT ACT ACT TCCT CT GCGGGCAGCC T C TC CACT T C AT CC CG A 
GGAAGCAGCTGTGTGACGGAGAGCTGGACTGTCCCTTGGGGGAGGACGAGGAGCACTGTGTCAAGAGCTTC 
CCCGAAGGGCCTGCAGTGGCAGTCCGCCTCTCCAAGGACCGATCCACACTGCAGGTGCTGGACTCGGCCAC 
AGGGAACTGGTT CTCTGCCT GTTT CGACAACTTCAC AGAAGC TC TCGCTG AGACAGCC TGT AGGC AGATG G 
GCTACAGCAGAGCTGTGGAGATTGGCCCAGACCAGGATCTGGATGTTGTTGAAATCACAGAAAACAGCCAG 
GAGCTTCGCATGCGGAACTCAAGTGGGCCCTGTCTCTCAGGCTCCCTGGTCTCCCTGCACTGTCTTGCCTG 
TGGGAAGAGCCTGAAGACCCCCCGTGTGGTGGGTGGGGAGGAGGCCTCTGTGGATTCTTGGCCTTGGCAGG 
TC AG CATCCAGT AC GACAAACAGC ACGT CTGT GG AG GG AGCATC CTGG ACCCCC AC TGGGT C CT C ACGGC A 
GC CC ACTGCT TC AGGAAAC ATACC GATGT GTT CAAC TGGAAGGT GCGGGCAGGC TC AG AC AAACTGGG CAG 
CT T C C C ATCC CT GGCT GTG GCCAAGATC AT CAT CAT TG AATT C AACCCCATGT ACCCC AAAGACAAT G AC A 
TCGCCCTCATGAAGCTGCAGTTCCCACTCACTTTCTCAGGCACAGTCAGGCCCATCTGTCTGCCCTTCTTT 
GATGAGGAGCTCACTCCAGCCACCCCACTCTGGATCATTGGATGGGGCTTTACGAAGCAGAATGGAGGGAA 
GAT GT C TGACAT ACTGCT GC AGGC GT CAGT CC AGGT C ATTGACAGC AC ACG GT GCAATGC AGACGATGC GT 
ACCAGGGGGAAGTCACCGAGAAGATGATGTGTGCAGGCATCCCGGAAGGGGGTGTGGACACCTGCCAGGGT 
GACAGTGGTGGGCCCCTGATGTACCAATCTGACCAGTGGCATGTGGTGGGCATCGTTAGCTGGGGCTATGG 
CTGC GGGGGC C C GAGC ACCCCAGGAG TATACACC AAGGTCTCAGCCT ATCT C AACTGG AT CT ACAATGTCT 
GGAAGGCTGAGCTG TAAT GCTGCTGCCCCTTTGCAGTGCTGGGAGCCGCTTCCTTCCTGCCCTGCCCACCT 
GGGG AT CCCC CAAAGT CAGACACAGAGC AAGAGT CC CCT TG G GT AC ACCCCTCT GCCC AC AGCCT C AGC AT 
TTCTTGGAGCAGCAAAGGGCCTCAATTCCTGTAAGAGACCCTCGCAGCCCAGAGGCGCCCAGAGGAAGTCA 
GCAGCCCTAGCTCGGCCACACTTGGTGCTCCCAGCATCCCAGGGAGAGACACAGCCCACTGAACAAGGTCT 
CAGGGGTATT GCTAAGCCAAGAAGGAACTTTCCC AC ACT ACT GAATGGAAGCAGGCTGTCTT GT AAAAGCC 
CAGATCACTGTGGGCTGGAGAGGAGAAGGAAAGGGTCTGCGCCAGCCCTGTCCGTCTTCACCCATCCCCAA 
GCCT AC T AGAGC AAGAAAC C AG TT GT AAT AT AAAAT GC ACT GCC CT ACTGT T GGT ATG ACT ACCGT T ACCT 
ACTGTTGTCATTGTTATTACAGCTATGGCCACTATTATTAAAGAGCTGTGTAACATCTCTGGCAAAAAAAA 
AAAA 
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MLQDPDSDQPLNSLDVKPLRKPRIPMETFRKVGIPIIIALLSLASIIIVWLIKVILDKYYFLCG 
QPLHFIPRKQLCDGELDCPLGEDEEHCVKSFPEGPAVAVRLSKDRSTLQVLDSATGNWFSACFDN 
FTEALAE TACRQMG YSRAVE I GPDQDLDWE I TENS QE LRMRNS S G PCLS GS LVS LHCLACGKS L 
KTPRWGGEEASVDSWPWQVSIQYDKQHVCGGSILDPHWLTAAHCFRKHTDVFNWKVRAGSDKL 
GS FPSLAVAKI 1 1 IEFNPMYPKDNDI ALMKLQFPLT FSGTVRP I CLP FFDEELTPATPLWI IGWG 
FTKQNGGKMSD I LLQAS VQVI DS TRCNADDAYQGEVTEKMMCAG I PEGGVDTCQGDS GGPLMYQS 
DQWHWGI VS WGYGCGGPS TPGVYTKVSAYLNW I YNVWKAEL 

Transmembrane domain: 

amino acids 32-53 (typell) 
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FIGURE lis 

GGCTGGACTGGAACTCCTGGTCCCAAGTGATCCACCCGCCTCAGCCTCCCAAGGTGCTGTGATTA 
TAGG TG T AAGCCACCG T G T CT GGCCT CT GAACAAC T T T T T CAGCAACTAAAAAAGCCACAGGAGT 
TGAACTGCTAGGATTCTGACTATGCTGTGGTGGCTAGTGCTCCTACTCCTACCTACATTAAAATC 
TGTTTTTTGTTCTCTTGTAACTAGCCTTTACCTTCCTAACACAGAGGATCTGTCACTGTGGCTCT 
GGCCCAAACCTGACCTTCACTCTGGAACGAGAACAGAGGTTTCTACCCACACCGTCCCCTCGAAG 
CCGGGGACAGCCTCACCTTGCTGGCCTCTCGCTGGAGCAGTGCCCTCACCAACTGTCTCACGTCT 
GGAGGCACTGACTCGGGCAGTGCAGGTAGCTGAGCCTCTTGGTAGCTGCGGCTTTCAAGGTGGGC 
CTTGCCCTGGCCGTAGAAGGGATTGACAAGCCCGAAGATTTCATAGGCGATGGCTCCCACTGCCC 
AGGCATCAGCCTTGCTGTAGTCAATCACTGCCCTGGGGCCAGGACGGGCCGTGGACACCTGCTCA 
GAAGCAGTGGGTGAGACATCACGCTGCCCGCCCATCTAACCTTTTCATGTCCTGCACATCACCTG 
ATCCATGGGCTAATCTGAACTCTGTCCCAAGGAACCCAGAGCTTGAGTGAGCTGTGGCTCAGACC 
CAGAAGGGGTCTGCTTAGACCACCTGGTTTATGTGACAGGACTTGCATTCTCCTGGAACATGAGG 
GAACGCCGGAGGAAAGCAAAGTGGCAGGGAAGGAACTTGTGCCAAATTATGGGTCAGAAAAGATG 
GAGGTGTTGGGTTATCACAAGGCATCGAGTCTCCTGCATTCAGTGGACATGTGGGGGAAGGGCTG 
CCGATGGCGCATGACACACTCGGGACTCACCTCTGGGGCCATCAGACAGCCGTTTCCGCCCCGAT 
CCACGTACCAGCTGCTGAAGGGCAACTGCAGGCCGATGCTCTCATCAGCCAGGCAGCAGCCAAAA 
TCTGCGATCACCAGCCAGGGGCAGCCGTCTGGGAAGGAGCAAGCAAAGTGACCATTTCTCCTCCC 
CTCCTTCCCTCTGAGAGGCCCTCCTATGTCCCTACTAAAGCCACCAGCAAGACATAGCTGACAGG 
GGCTAAT GGCT CAGT GT T GGCC CAGGAGG T CAGCAAGGC C T GAGAGCT GAT CAGAAGGGCC T GCT 
GTGCGAACACGGAAATGCCTCCAGTAAGCACAGGCTGCAAAATCCCCAGGCAAAGGACTGTGTGG 
C TCAAT T T AAAT CAT G T TC TAGT AAT T GG AG C TG T CC CC AAGAC CAAAGG AG CT AGAGC T T GG T T 
CAAAT GAT C T CC AAGGGC C C T TAT AC CCC AGGAGAC T T T GAT T T GAAT T T GAAAC C CC AAAT C CA 
AACCTAAGAACCAGGTGCATTAAGAATCAGTTATTGCCGGGTGTGGTGGCCTGTAATGCCAACAT 
TTTGGGAGGCCGAGGCGGGTAGATCACCTGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGG 
TGAAACCCCTGTCTCTACTAAAAATACAAAAAAACTAGCCAGGCATGGTGGTGTGTGCCTGTATC 
C CAG C T AC T CGG GAG GCT GAGACAG G AGAAT T AC T T G AACC T G G GAGG T GAAG GAG GCT GAGACA 
G G AGAAT CAC T T CAGC C T GAG C AACACAG C GAG AC TCTGTCTC AGAAAAAAT AAAAAAAGAAT T A 
TGGTTATTTGTAA 
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MLWWLVLLLLPTLKSVFCSLVTSLYLPNTEDLSLWLWPKPDLHSGTRTEVSTHTVPSKPGTASPC 
WPLAGAVPSPTVSRLEMjTRAVQVAEPLGSCGFQGGPCPGRRRD 

Signal peptide: 

amino acids 1-15 
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FIGURE lis 



CAGCAGT GGT CT CTCAGT CCTCTCAAAGCAAGGAAAGAGTACT G TGTGCT GAG AG AC CATGGC AA 
AGAATCCTCCAGAGAAT TGTGAAGACTGTCACATTCTAAATGCAGAAGCT T TTAAATCCAAGAAA. 
ATATGTAAATCACTTAAGATTTGTGGACTGGTGTTTGGTATCCTGGCCCTAACTCTAATTGTCCT 
GTTTTGGGGGAGCAAGCACTTCTGGCCGGAGGTACCCAAAAAAGCCTATGACATGGAGCACACTT 
TCTACAGCAATGGAGAGAAGAAGAAGATTTACATGGAAATTGATCCTGTGACCAGAACTGAAATA 
T TC AGAAGC GGAAAT GGC AC TGAT GAAACAT T GGAAG T GC ACG AC T T T AAAAAC GGAT AC AC TGG 
M CATCTACTTCGTGGGTCTTCAAAAATGTTTTATCAAAACTCAGATTAAAGTGATTCCTGAATTTT 
^ C TGAACCAGAAGAGGAAATAGAT GAGAA.TGAAGAAAT T ACC AC AAC T T TC T TTGAACAGT CAGTG 



m 



5: ;vt 

Hi 



ATT TGGGT CCCAGCAGAAAAGCCT AT TGAAAAC CGAGAT T T TC T TAAAAAT T CCAAAAT T CTGGA 
GATTTGTGATAACGTGACCATGTATTGGATCAATCCCACTCTAATATCAGTTTCTGAGTTACAAG 
ACTTTGAGGAGGAGGGAGAAGATCTTCACTTTCCTGCCAACGAAAAAAAAGGGATTGAACAAAAT 
GAACAGT GGG T GGT CCCT CAAGTGAAAGTAGAGAAGACC CG TC ACGCCAGACAAGCAAGT GAGGA 
AGAACTTCCAATAAATGACTATACTGAAAATGGAATAGAATTTGATCCCATGCTGGATGAGAGAG 
GTTATTGTTGTATTTACTGCCGTCGAGGCAACCGCTATTGCCGCCGCGTCTGTGAACCTTTACTA 
GGCTACTACCCATATCCATACTGCTACCAAGGAGGACGAGTCATCTGTCGTGTCATCATGCCTTG 
TAACTGGTGGGTGGCCCGCATGCTGGGGAGGGTCTAATAGGAGGTTTGAGCTCAAATGCTTAAAC 
TGCTGGCAACATATAATAAATGCATGCTATTCAATGAATTTCTGCCTATGAGGCATCTGGCCCCT 
GGTAGCCAGCTCTCCAGAATTACTTGTAGGTAATTCCTCTCTTCATGTTCTAATAAACTTCTACA 
T TAT C AC CAAAAAAAAAAAAAAAAAAA 
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FIGURE 116 



MAKNP PENCEDCH I LNASAFKSKKI CKS LK I CGLVFG I LALTL I VL FWGSKHFWPE VPKKAYDME 
HTFYSNGEKKKIYMEIDPVTRTEIFRSGNGTDETLEVHDFKNGYTGIYFVGLQKCFIKTQIKVIP 
EFSEPEEEIDENEEITTTFFEQSVIWVPAEKPIENRDFLKNSKILEICDNVTMYWINPTLISVSE 
LQDFEEEGEDLHFPANEKKGIEQNEQWWPQVKVEKTRHARQASEEELPINDYTENGIEFDPMLD 
ERGYCC I YCRRGNRYCRRVCEPLLGYYPYP YCYQGGRVI CRVIMPCNWWVARMLGRV 

Important features of the protein: 
Signal peptide: 

amino acids 1-40 



Transmembrane domain: 

p| amino acids 25-47 (type II) 



N-glycosylation sites. 

amino acids 94-97, 180-183 

Glycosaminoglycan attachment sites. 

amino acids 92-95, 70-73, 85-88, 133-136, 148-151, 192-195, 239- 



m 242 



N-myristoylation sites. 

amino acids 33-38, 95-100, 116-121, 215-220, 272-277 

Microbodies C-terminal targeting signal. 

amino acids 315-317 



Cytochrome c family heme-binding site signature. 

amino acids 9-14 
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FIGURE 117 



GAGCTCCCCTCAGGAGCGCGTTAGCTTCACACCTTCGGCAGCAGGAGGGCGGCAGCTTCTCGCAGGCGGCA 
GGGCGGGCGGCCAGGATCATGTCCACCACCACATGCCAAGTGGTGGCGTTCCTCCTGTCCATCCTGGGGCT 
GGCCGGCTGCATCGCGGCCACCGGGATGGACATGTGGAGCACCCAGGACCTGTACGACAACCCCGTCACCT 
CCGTGTTCCAGTACGAAGGGCTCTGGAGGAGCTGCGTGAGGCAGAGTTCAGGCTTCACCGAATGCAGGCCC 
TATTTCACCATCCTGGGACTTCCAGCCATGCTGCAGGCAGTGCGAGCCCTGATGATCGTAGGCATCGTCCT 
GGGTGCCATTGGCCTCCTGGTATCCATCTTTGCCCTGAAATGCATCCGCATTGGCAGCATGGAGGACTCTG 
CCAAAGCCAACATGACACTGACCTCeGGGATCATGTTCATTGTCTCAGGTCTTTGTGCAATTGCTGGAGTG 
TCTGTGTTTGCCAACATGCTGGTGACTAACTTCTGGATGTCCACAGCTAACATGTACACCGGCATGGGTGG 

jE GATGGTGCAGACTGTTCAGACCAGGTACACATTTGGTGCGGCTCTGTTCGTGGGCTGGGTCGCTGGAGGCC 

$*? 

fgj TCACACTAATTGGGGGTGTGATGATGTGCATCGCCTGCCGGGGCCTGGCACCAGAAGAAACCAACTACAAA 

~r» 

|g GCCGTTTCTTATCATGCCTCAGGCCACAGTGTTGCCTACAAGCCTGGAGGCTTCAAGGCCAGCACTGGCTT 

|IS TGGGTC C AAC ACC AAAAACAAGAAGAT ATACGAT GG AGGTGC CC GCAC AG AGGACG AGGT AC AATCT T AT C 

-a 

CT TC C AAGCACG AC T ATGT GTAAT GCTCT AAGACCTCTCAGC AC GGGCGG AAGAAACT CCCGGAG AGC TC A 
IP? CC C AAAAAAC AAGGAG ATCC CATC T AGAT TTC TT CT TGCTTT TGACTC AC AGCT GGAAGTT AGAAAAGCCT 



P 



CGATTTCATCTTTGGAGAGGCCAAATGGTCTTAGCCTCAGTCTCTGTCTCTAAATATTCCACCATAAAACA 
GCTGAGTTATTTATGAATTAGAGGCTATAGCTCACATTTTCAATCCTCTATTTCTTTTTTTAAATATAACT 
TTCTACTCTGATGAGAGAATGTGGTTTTAATCTCTCTCTCACATTTTGATGATTTAGACAGACTCCCCCTC 
TTCCTCCTAGTCAATAAACCCATTGATGATCTATTTCCCAGCTTATCCCCAAGAAAACTTTTGAAAGGAAA 
GAGTAGACCCAAAGATGTTATTTTCTGCTGTTTGAATTTTGTCTCCCCACCCCCAACTTGGCTAGTAATAA 
ACACTTACTGAAGAAGAAGCAATAAGAGAAAGATATTTGTAATCTCTCCAGCCCATGATCTCGGTTTTCTT 
ACACTGTGATCTTAAAAGTTACCAAACCAAAGTCATTTTCAGTTTGAGGCAACCAAACCTTTCTACTGCTG 
TTGACATCTTCTTATTACAGCAACACCATTCTAGGAGTTTCCTGAGCTCTCCACTGGAGTCCTCTTTCTGT 
C GCGGGTCAG AAAT T GTCCCTAGATG AAT GAGAAAATT ATT T TT T TT AAT T T AAGT C CT AAAT AT AGT T AA 
AATAAAT AAT GT T T T AGT AAAAT GAT AC ACTAT C T C TGTG AAAT AGCCTC ACCC CT AC AT GT GGAT AG AAG 
GAAATGAAAAAATAATTGCTTTGACATTGTCTATATGGTACTTTGTAAAGTCATGCTTAAGTACAAATTCC 
ATGAAAAGCTCACACCTGTAATCCTAGCACTTTGGGAGGCTGAGGAGGAAGGATCACTTGAGCCCAGAAGT 
TCGAGACTAGCCTGGGCAACATGGAGAAGCCCTGTCTCTACAAAATACAGAGAGAAAAAATCAGCCAGTCA 
TGGTGGCATACACCTGTAGTCCCAGCATTCCGGGAGGCTGAGGTGGGAGGATCACTTGAGCCCAGGGAGGT 
T GGGGC TGCAGT GAGC CAT GAT CACACC ACT GC ACTCCAGCC AGGTGACAT AGC GAGATCCT GT CT AAAAA 
AATAAAAAATAAATAATGGAACACAGCAAGTCCTAGGAAGTAGGTTAAAACTAATTCTTTAA 
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FIGURE 118 



MSTTTCQWAFLLSILGLAGCIAATGMDMWSTQDLYDNPVTSVFQYEGLWRSCVRQSSGFTECRP 
YFTILGLPAMLQAVRALJyTIVGIVLGAIGLLV^ 

CAIAGVSVFAmLVTNFWMSTAlMYTGMGGMVQTVQTRYTFGAALFVGWVAGGLTLIGGVMMCIA 
CRGLAPEETNYKAVSYHASGHSVAYKPGGFKASTGFGSNTKNKKIYDGGARTEDEVQ5YPSKHDY 
V 

Signal peptide: 



amino acids 1-23 



Transmembrane domains: 




amino acids 81-100, 121-141, 173-194 
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FIGURE HQ 



GGAAAAACTGTTCTCTTCTGTGGCACAGAGAACCCTGCTTCAAAGCAGAAGTAGCAGTTCCGGAGTCC 
AGCTGGCTAAAACTCATCCCAGAGGATAATGGCAACCCATGCCTTAGAAATCGCTGGGCTGTTTCTTG 
GTGGTGTTGGAATGGTGGGCACAGTGGCTGTCACTGTCATGCCTCAGTGGAGAGTGTCGGCCTTCATT 
GAAAACAACATCGTGGTTTTT.GAAAACTTCTGGGAAGGACTGTGGATGAATTGCGTGAGGCAGGCTAA 
CATCAGGATGCAGTGCAAAATCTATGATTCCCTGCTGGCTCTTTCTCCGGACCTACAGGCAGCCAGAG 
GACTGATGTGTGCTGCTTCCGTGATGTCCTTCTTGGCTTTCATGATGGCCATCCTTGGCATGAAATGC 
^ ACCAGGTGCACGGGGGACAATGAGAAGGTGAAGGCTCACATTCTGCTGACGGCTGGAATCATCTTCAT 
□ CAT CACGGGCAT GGTGGTGCT CATCCC T GTGAG CT GGGT T GCCAAT GC CATCATCAGAGATTTCTATA 

CI ACTCAATAGTGAATGTTGCCCAAAAACGTGAGCTTGGAGAAGCTCTCTACTTAGGATGGACCACGGCA 
Clj CTGGTGCTGATTGTTGGAGGAGCTCTGTTCTGCTGCGTTTTTTGTTGCAACGAAAAGAGCAGTAGCTA 
CAGATACT CGAT ACCTT CCCATCGCACAACCCAAAAAAGT T AT CACACC GGAAAGAAGT CACCGAGCG 
T CTACTC CAGAAGTCAGTATGTG TAG TTGT GTATGT T TTT TTAACTTTACTATAAAGCCAT GCAAATG 
ACAAAAATCTATATTACTTTCTCAAAATGGACCCCAAAGAAACTTTGATTTACTGTTCTTAACTGCCT 
AATCTTAATTACAGGAACTGTGCATCAGCTATTTATGATTCTATAAGCTATTTCAGCAGAATGAGATA 
TTAAACCCAATGCTTTGATTGTTCTAGAAAGTATAGTAATTTGTTTTCTAAGGTGGTTCAAGCATCTA 
111 CTCTTTTTATCATTTACTTCAAAATGACATTGCTAAAGACTGCATTATTTTACTACTGTAATTTCTCC 
***** ACGACATAGCATTATGTACATAGATGAGTGTAACATTTATATCTCACATAGAGACATGCTTATATGGT 

Id 

2 T TT ATTTAAAAT GAAAT GCCAGT CCAT TACACT GAATAAATAGAACT CAACTATTGCTT TTCAGG GAA 

ATCATGGATAGGGTTGAAGAAGGTTACTATTAATTGTTTAAAAACAGCTTAGGGATTAATGTCCTCCA 
TTTATAAT GAAGATTAAAATGAAGGCT TTAATCAGCATTGTAAAGGAAATTGAATGGCTTTCTGATAT 
GCTGTTTTTTAGCCTAGGAGTTAGAAATCCTAACTTCTTTATCCTCTTCTCCCAGAGGCTTTTTTTTT 
CTTGTGTATTAAATTAACATTTTTAAAACGCAGATATTTTGTCAAGGGGCTTTGCATTCAAACTGCTT 
TTCCAGGGCTAT ACTC AGAAGAAAGAT AAAAGT GTGATC TAAGAAAAAGTGAT GGT TT TAGGAAAGTG 
AAAATAT T TTT GT TTT T GTAT TTGAAGAAGAAT GAT GCAT TTT GACAAGAAATCATATAT GTATGGAT 
ATAT TTTAATAAGTAT T TGAGTACAGACTTT GAGGT T TC AT CAAT ATAAAT AAAAG AGC AGAAAAATA 
T GT CTTGGTTT T CATTT GCT T ACCAAAAAAACAACAACAAAAAAAGT TGT CCTTTGAGAACTTCACCT 
GCTCCTATGTGGGTACCTGAGTCAAAATTGTCATTTTTGTTCTGTGAAAAATAAATTTCCTTCTTGTA 
CCATTTCTGTTTAGTTTTACTAAAATCTGTAAATACTGTATTTTTCTGTTTATTCCAAATTTGATGAA 
ACTGACAATCCAATTTGAAAGTTTGTGTCGACGTCTGTCTAGCTTAAATGAATGTGTTCTATTTGCTT 
TAT ACAT T TAT AT TAATAAAT TGTACATT T T T CTAAT T 



m 



m 
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FIGURE 120 



m 



7 : 

; : ;:F 



MATHALEIAGLFLGGVGWGTVAVTVM^^ 
IYDSLLALSPDLQAARGLMCAASVMSFLAF^ 

MWLI PVS WVANAI IRDFYNS I VNVAQKRELGEAL YLGWT TALVL I VGGALFCCVFCCNEKS S S Y 
RYS I PSHRTTQKS YHTGKKSPSVYSRS QYV 

Signal peptide: 
amino acids 1-17 

Transmembrane domains: 

amino acids 82-101, 118-145, 164-188 
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FIGURE 121 



GGAGAGAGGCGCGCGGGTGAAAGGCGCATTGATGCAGCCTGCGGCGGCCTCGGAGCGCGGCGGAG 
CCAGACGCTGACCACGTTCCTCTCCTCGGTCTCCTCCGCCTCCAGCTCCGCGCTGCCCGGCAGCC 
GGGAGCCATGCGACCCCAGGGCCCCGCCGCCTCCCCGCAGCGGCTCCGCGGCCTCCTGCTGCTCC 
TGCTGCTGCAGCTGCCCGCGCCGTCGAGCGCCTCTGAGATCCCCAAGGGGAAGCAAAAGGCGCAG 
CTCCGGCAGAGGGAGGTGGTGGACCTGTATAATGGAATGTGCTTACAAGGGCCAGCAGGAGTGCC 
TGGTCGAGACGGGAGCCCTGGGGCCAATGTTATTCCGGGTACA.CCTGGGATCCCAGGTCGGGATG 
f& GAT T CAAAGGAG AAAAGGGGGAATGT CTGAGGGAAAG CT T T GAGGAGTCC TGGACACC CAACTAC 

W AAGCAGTGTTCATGGAGTTCATTGAATTATGGCATAGATCTTGGGAAAATTGCGGAGTGTACATT 
TACAAAGATGCGTTCAAATAGTGCTCTAAGAGTTTTGTTCAGTGGCTCACTTCGGCTAAAATGCA 
|::| GAAATGCATGCTGTCAGCGTTGGTATTTCACATTCAATGGAGCTGAATGTTCAGGACCTCTTCCC 
fll AT T GAAGCT ATAAT T TAT T T GGAC CAAGGAAGCCC T GAAATGAATTCAACAAT TAATAT T CATCG 

0 CACTTCTTCTGTGGAAGGACTTTGTGAAGGAATTGGTGCTGGATTAGTGGATGTTGCTATCTGGG 

ril 

,w T TGGCAC T TG T TCAGAT TACCCAAAAGGAGAT GCT TC TAC T GGATG GAAT T CAGT TT CTC GCAT C 

% % AT TAT TGAAG AACT ACCAAAATAAAT G CT T TAAT T T T CAT T T GCTACC TCTTTTTT TAT T ATGCC 

111 T TGGAAT GGT T CAC T TAAAT GACAT T T TAAATAAGT TTAT G T ATACATC TGAAT GAAAAGCAAAG 

J3 C TAAAT AT G T T TAC AG AC CAAAG T G T GAT T T C ACAC T G T T T T TAAAT C T AGC AT TAT T CAT T T T G 

III CT TCAAT CAAAAGTGG T T TCAATAT T T T T T T TAGT T GGT TAGAATAC T T TCT TCATAGT CACAT T 

2! CTCTCAACCTATAATTTGGAATATTGTTGTGGTCTTTTGTTTTTTCTCTTAGTATAGCATTTTTA 

AAAAAATAT AAAAGCTAC CAAT C T T T G TACAAT T TG TAAAT GT TAAGAAT T T TT T T TATAT CTGT 

TAAAT AAAAAT TAT T TCCAACA 
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FIGURE 122 

MRPQGPAASPQRLRGLLLLLLLQLPAPSSASEIPKGKQKAQLRQREWDLYNGMCLQGPAGVPGR 
DGSPGANVIPGTPGIPGRDGFKGEKGECLRESFEESWTPNYKQCSWSSLNYGIDLGKIAECTFTK 
MRSNSALRVLFSGSLRLKCRNACCQRWYFT FNGAECSGPLP IEAI I YLDQGS PEMNS T INIHRTS 
SVEGLCEGIGAGLVDVAIWVGTCSDYPKGDASTGWNSVSRI I IEELPK 

Signal peptide: 

amino acids 1-30 

Transmembrane domain: 

amino acids 195-217 
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FIGURE 123 



GCTGAGCGTGTGCGCGGTACGGGGCTCTCCTGCCTTCTGGGCTCCAACGCAGCTCTGTGGCTGAA 
CTGGGTGCTCATCACGGGAACTGCTGGGCTATGGAATACAGATGTGGCAGCTCAGGTAGCCCCAA 
AT T GC C T G GAAGAAT ACAT CAT G T T T T T C GATAAGAAGAAAT T G TAGGATCC AGT TTTTTTTT T A 
ACCGCCCCCTCCCCACCCCCCAAAAAAACTGTAAAGATGCAAAAACGTAATATCCATGAAGATGC 
TAT TACCTAGGAAGAT T T T GAT G T T T T GC TGCGAATGCGG T G T TGGGAT TTAT TTGT T C T TGGAG 
TGTTCTGCGTGGCTGGCAAAGAATAATGTTCCAAAATCGGTCCATCTCCCAAGGGGTCCAATTTT 
TCTTCCTGGGTGTCAGCGAGCCCTGACTCACTACAGTGCAGCTGACAGGGGCTGTCATGCAACTG 
GCC CC TAAGCCAAAGCAAAAGACCTAAGGAC GACC T T TGAACAATACAAAGGATGGGT T TCAAT G 
TAATTAGGCTACTGAGCGGATCAGCTGTAGCACTGGTTATAGCCCCCACTGTCTTACTGACAATG 
CTTTCTTCTGCCGAACGAGGATGCCCTAAGGGCTGTAGGTGTGAAGGCAAAATGGTATATTGTGA 
^ ATCTCAGAAATTACAGGAGATACCCTCAAGTATATCTGCTGGTTGCTTAGGTTTGTCCCTTCGCT 
J;jf ATAAC AGCC T T CAAAAAC T TAAG TATAAT CAAT T TAAAGGGC T CAACCAGC T CACC TGGCTATAC 

CI C TT GACCATAACCATATCAGCAATATTGACGAAAATGCT TT TAATGGAATACGCAGACTCAAAGA 

CI1 GCTGATTCTTAGTTCCAATAGAATCTCCTATTTTCTTAACAATACCTTCAGACCTGTGACAAATT 
]g| TACGGAACTTGGATCTGTCCTATAATCAGCTGCATTCTCTGGGATCTGAACAGTTTCGGGGCTTG 

5 CGGAAGCTGCTGAGTTTACATTTACGGTCTAACTCCCTGAGAACCATCCCTGTGCGAATATTCCA 
AGACT GC CGCAACC TGGAACT TT TGGACC TGGGAT AT AACC GGATCCGAAGTTT AGCCAGGAATG 

6 T CT T T GCTGGCATGAT CAGAC T CAAAGAACT T CACCT GGAGCACAATCAAT T TT CCAAGCT CAAC 
f| CTGGCCCTTTTTCCAAGGTTGGTCAGCCTTCAGAACCTTTACTTGCAGTGGAATAAAATCAGTGT 

CAT AGGACAGAC CATG TC CT GGACCT GGAGC TCCT TACAAAGGC T TGATT TATC AGGCAAT GAGA 
*F| TCGAAGCTTTCAGTGGACCCAGTGTTTTCCAGTGTGTCCCGAATCTGCAGCGCCTCAACCTGGAT 

TCCAACAAGCTCACATTTATTGGTCAAGAGATTTTGGATTCTTGGATATCCCTCAATGACATCAG 
TCTTGCTGGGAATATATGGGAATGCAGCAGAAATATTTGCTCCCTTGTAAACTGGCTGAAAAGTT 
Q T TAAAGGTC TAAGGGAGAAT ACAAT T ATC T G T GCCAGTCCC AAAGAGC TGCAAGGAGT AAAT G T G 

|.pf ATCGATGCAGTGAAGAAC TACAGCAT CTGTGGCAAAAGTAC TACAGAGAGGT TTGATCT GGCCAG 

jS% GGC TC TC CCAAAGC CGAC G T T T AAGC CCAAGC TCCCCAGG C CGAAGCAT GAGAGCAAAC CCCCT T 

y TGCCCCCGACGGTGGGAGCCACAGAGCCCGGCCCAGAGACCGATGCTGACGCCGAGCACATCTCT 
ft TTCCATAAAATCATCGCGGGCAGCGTGGCGCTTTTCCTGTCCGTGCTCGTCATCCTGCTGGTTAT 

CTACGTGTCATGGAAGCGGTACCCTGCGAGCATGAAGCAGCTGCAGCAGCGCTCCCTCATGCGAA 
GGCACAGGAAAAAGAAAAGACAG T CC CTAAAGCAAAT GAC T CC CAGCACCCAGGAAT T T T AT GTA 
GATTATAAACCCACCAACACGGAGACCAGCGAGATGCTGCTGAATGGGACGGGACCCTGCACCTA 
TAACAAATCGGGCTCCAGGGAGTGTGAGGTATGAACCATTGTGATAAAAAGAGCTCTTAAAAGCT 
GGGAAATAAGTGGTGCTTTATTGAACTCTGGTGACTATCAAGGGAACGCGATGCCCCCCCTCCCC 
TTCCCTCTCCCTCTCACTTTGGTGGCAAGATCCTTCCTTGTCCGTTTTAGTGCATTCATAATACT 
G G T CAT TTTCCTCT CAT ACAT AAT CAACC CAT T GAAAT T T AAAT AC CAC AAT CAAT G T GAAGC T T 
GAAC T CC G GT T T AAT AT AAT AC C T AT T G T ATAAGAC C C T T T AC T GAT T CC AT T AAT G T C G CAT T T 
GT T T T AAGATAAAAC T T C T TT CAT AGGTAAAAAAAAAAA 
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FIGURE 124 

MGFNVIRLLSGSAVALVIAPTVLLTMLSSAERGCPKGCRCEGKMVYCESQKLQEIPSSISAGCLG 
LSLRYNSLQKLKYNQFKGLNQLTWLYLDHNHISNIDENAFNGIRRLKELILSSNRISYFLNNTFR 
PVTNLRNLDLSYNQLHSLGSEQFRGLRKLLSLHLRSNSLRTIPVRIFQDCRNLELLDLGYNRIRS 
LARNVFAGMIRLKELHLEHNQFSKLNLALFPRLVSLQNLYLQWNKISVIGQTMSWTWSSLQRLDL 
S GNE I E AFS G P S VFQC VPNLQRLNL DSNKLTFIGQEILDSWIS LND I S LAGN IWECSRNICS L VN 
WLKSFKGLRENTIICASPKELQGVNVIDAVKNYSICGKSTTERFDLARALPKPTFKPKLPRPKHE 
SKPPLPPTVGATEPGPETDADAEHISFHKIIAGSVALFLSVLVILLVIYVSWKRYPASMKQLQQR 
SLMRRHRKKKRQSLKQMTPSTQE FYVDYKPTNTETSEMLLNGTGPCTYNKSGSRECEV 

Important features of the protein: 
Signal peptide: 

amino acids 1-33 

Transmembrane domain: 

amino acids 420-442 

N-glycosylation sites. 

amino acids 126-129, 357-360, 496-499, 504-507 

cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 465-468 

Tyrosine kinase phosphorylation site. 

amino acids 136-142 

N-myristoylation sites. 

amino acids 11-16, 33-38, 245-250, 332-337, 497-502, 507-512 
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FIGURE 12* 



CCGTT AT CGTCT T GCGCTACT GCT GAATGTCCG T CCC G GAG GAGGAGGAGAGGC T T T T GC CGC T G 
ACCCAGAGATGGCCCCGAGCGAGCAAATTCCTACTGTCCGGCTGCGCGGCTACCGTGGCCGAGCT 
AGCAACC T T TCC CC T GGAT CT C ACAAAAAC T CGAC T C CAAATG CAAGGAGAAGC AG CT CT T GC TC 
GGTTGGGAGACGGTGCAAGAGAATCTGCCCCCTATAGGGGAATGGTGCGCACAGCCCTAGGGATC 
ATTGAAGAGGAAGGCTTTCTAAAGCTTTGGCAAGGAGTGACACCCGCCATTTACAGACACGTAGT 
GTATTCTGGAGGTCGAATGGTCACATATGAACATCTCCGAGAGGTTGTGTTTGGCAAAAGTGAAG 
ATGAGCATTATCCCCTTTGGAAATCAGTCATTGGAGGGATGATGGCTGGTGTTATTGGCCAGTTT 
TTAGCCAAT CCAAC TGACCT AGTGAAGGTTCAGATGCAAAT GGAAGGAAAAAGGAAACTGGAAGG 
AAAAC CAT T GCGAT T T CGTGGT G T ACATCATGCAT T T GCAAAAATCT TAGC TGAAGGAGGAATAC 
GAGGGCT T T GGG CAGGCTGGGT ACCCAATATACAAAGAGCAGCAC TGG T GAATAT GGGAGAT T TA 
AC C AC T TAT GAT AC AG TG AAAC AC TACTTGGTATT GAAT AC AC CAC T T GAGG AC AATAT CAT GAC 
TCACGGTTTATCAAGTTTATGTTCTGGACTGGTAGCTTCTATTCTGGGAACACCAGCCGATGTCA 



ACTGACTGCTTGATTCAGGCTGTTCAAGGTGAAGGATTCATGAGTCTATATAAAGGCTTTTTACC 



TCAAAAGCAGAATAATGAAT CAACCACGAGATAAACAAGGAAGGGGAC T T T TG T AT AAAT CATCG 

13 

m ATCTTGGCTGAGAATGACCCCTTGGTCAATGGTGTTCTGGCTTACTTATGAAAAAATCAGAGAGA 
(J TGAG T GGAGTCAGT CCAT T T TAA 

18 
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FIGURE 126 



MSVPEEEERLLPLTQRWPRASKFLLSGCAATVAELATFPLDLTKTRLQMQGEAALARLGDGARES 
APYRGMVRTALGIIEEEGFLKLWQGVTPAIYRHWYSGGRMVTYEHLREWFGKSEDEHYPLWKS 
VI GGMMAG VI GQFLANPTDLVKVQMQMEGKRKLEGKPLRFRGVHHAFAK I LAEGGIRGLWAGWVP 
NIQRAAJjVNMGDLTTYDTVKHYLVLNTPLEDNIMTHGLSSLCSGLVASILGTPADVIKSRIMNQP 
RDKQGRGLLYKSSTDCLIQAVQGEGFMSLYKGFLPSWLRMTPWSMVFWLTYEKIREMSGVSPF 

Transmembrane domains: 

amino acids 25-38, 130-147, 233-248 
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FIGURE 127 



CGCGGATCGGACCCAAGCAGGTCGGCGGCGGCGGCAGGAGAGCGGCCGGGCGTCAGCTCCTCGAC 
CCCCGTGTCGGGCTAGTCCAGCGAGGCGGACGGGCGGCGTGGGCCCATGGCCAGGCCCGGCATGG 
AGCGGTGGCGCGACCGGCTGGCGCTGGTGACGGGGGCCTCGGGGGGCATCGGCGCGGCCGTGGCC 
CGGGCCCTGGTCCAGCAGGGACTGAAGGTGGTGGGCTGCGCCCGCACTGTGGGCAACATCGAGGA 
GCTGGCTGCTGAATGTAAGAGTGCAGGCTACCCCGGGACTTTGATCCCCTACAGATGTGACCTAT 
CAAAT GAAGAGGACAT CC TCTC CATGT T C TCAGCTAT CCG T TCTCAGCACAGCGG T G T AGACATC 
^ TGCATCAACAATGCTGGCTTGGCCCGGCCTGACACCCTGCTCTCAGGCAGCACCAGTGGTTGGAA 
W GGACATGTTCAATGTGAACGTGCTGGCCCTCAGCATCTGCACACGGGAAGCCTACCAGTCCATGA 
(3 AGGAGCGGAAT G TGGACGAT GGGCAC AT CAT TAACAT CAAT AGCATGT CT GGCCAC CGAGTGTTA 

CCCC T G T C T GT GAC CCAC T T CT AT AGTGCCACCAAGTAT GCCGTCACT GC GC TGACAGAGGGAC T 



& GAGGCAAGAGCTTCGGGAGGCCCAGACCCACATCCGAGCCACGTGCATCTCTCCAGGTGTGGTGG 
& AGACACAATTCGCC T T CAAACT CCAC GACAAGGACCC TGAGAAGGCAGCT GCCACCT ATGAGCAA 



ATGAAGT GTCTCAAACCCGAGGAT GT GGCCGAGGC TGT TAT CT ACGTCCTCAGCACCCCCGCACA 
CATCCAGATTGGAGACATCCAGATGAGGCCCACGGAGCAGGTGACCTAGTGACTGTGGGAGCTCC 
TCCTTCCCTCCCCACCCTTCATGGCTTGCCTCCTGCCTCTGGATTTTAGGTGTTGATTTCTGGAT 



m CACGGGATACCACTTCCTGTCCACACCCCGACCAGGGGCTAGAAAATTTGTTTGAGATTTTTATA 
M TCATCTTGTCAAATTGCTTCAGTTGTAAATGTGAAAAATGGGCTGGGGAAAGGAGGTGGTGTCCC 



TAATTGTTTTACTTGTTAACTTGTTCTTGTGCCCCTGGGCACTTGGCCTTTGTCTGCTCTCAGTG 



PI 

Ill TCTTCCCTTTGACATGGGAAAGGAGTTGTGGCCAAAATCCCCATCTTCTTGCACCTCAACGTCTG 



TGGCTCAGGGCTGGGGTGGCAGAGGGAGGCCTTCACCTTATATCTGTGTTGTTATCCAGGGCTCC 
AGACTTCCTCCTCTGCCTGCCCCACTGCACCCTCTCCCCCTTATCTATCTCCTTCTCGGCTCCCC 
AGCCCAGTCTTGGCTTCTTGTCCCCTCCTGGGGTCATCCCTCCACTCTGACTCTGACTATGGCAG 
CAGAACACCAGGGCC T GGCCCAG T GGAT T T CATGG T GAT CATT AAAAAAGAAAAAT CGCAACCAA 
AAAAAAAAAA 
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FIGURE 128 

MARPGME RWRDRLALVTGAS GG I GAAVARAL VQQGLKWGCART VGN I EE LAAE CKS AG YPG TL I 
PYRCDLSNEEDILSMFSAIRSQHSGVDICINNAGLARPDTLLSGSTSGWKDMFNVNVLALSICTR 
EAYQSMKERNVDDGH I IN I NSMS GHRVLPLS VTHFYS ATKYAVT ALTEGLRQELREAQTH I RATC 
ISPGWETQFAFKLHDKDPEKAAATYEQMKCLKPEDVAEAVIYVLSTPAHIQIGDIQMRPTEQVT 

Important features of the protein: 
Signal peptide: 

amino acids 1-17 

N-myristoylation sites. 

amino acids 18-24, 21-27, 22-28, 24-30, 40-46, 90-96, 109-115, 
199-205 

Short-chain alcohol dehyrogenase . 

amino acids 30-42, 104-114 
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FIGURE 12Q 



AA.CTTCTAC ATG GGCCTCCTGCTGCTGGTGCTCTTCCTCAGCCTCCTGCCGGTGGCCTACACCAT 
CATGTCCCTCCCACCCTCCTTTGACTGCGGGCCGTTCAGGTGCAGAGTCTCAGTTGCCCGGGAGC 
ACCTCCCCTCCCGAGGCAGTCTGCTCAGAGGGCCTCGGCCCAGAATTCCAGTTCTGGTTTCATGC 
CAGCCTGTAAAAGGCCATGGAACTTTGGGTGAATCACCGATGCCATTTAAGAGGGTTTTCTGCCA 
GGATGGAAATGTTAGGTCGTTCTGTGTCTGCGCTGTTCATTTCAGTAGCCACCAGCCACCTGTGG 
CCGTTGAGTGCTTGAAATGAGGAACTGAGAAAATTAATTTCTCATGTATTTTTCTCATTTATTTA 
T TAAT T T T T AAC T GAT AG T T G T AC AT AT T T GGG GG T ACATGT GATAT T TGGATACAT GT AT AC AA 
TATAT AATGAT CAAAT CAGGGT AACT GGGATAT CCAT CACATCAAACATT TATT T T T TAT T CT T T 

:S T TAGACAGAG T C T CAC TC T GT CAC CCAGGC TGGAG T GCAG T GGT GC CATC TCAGC T TACTGGAA.C 

P 

Us CTCTGCCTGCCAGGTTCAAGCGATTCTCATGCCTCCACCTCCCAAGTAGCTGGGACTACAGGCAT 

fg GCACCACAATGCCCAACTAATTTTTGTATTTTTAGTAGAGACGGGGTTTTGCCATGTTGCCCAGG 

CI CTGGCCTTGAACTCCTGGCCTCAAACAATCCACTTGCCTCGGCCTCCCAAAGTGTTATGATTACA 

ft^ GGCGTGAGCCACCGTGCCTGGCCTAAACATTTATCTTTTCTTTGTGTTGGGAACTTTGAAATTAT 

-¥s ACAATGAATTATTGTTAACTGTCATCTCCCTGCTGTGCTATGGAACACTGGGACTTCTTCCCTCT 

W- 

Ig ATCTAACTGTATATTTGTACCAGTTAACCAACCGTACTTCATCCCCACTCCTCTCTATCCTTCCC 
Q AACCTCTGATCACCTCATTCTACTCTCTACCTCCATGAGATCCACTTTTTTAGCTCCCACATGTG 
If AGTAAGAAAATGCAATATTTGTCTTTCTGTGCCTGGCTTATTTCACTTAACATAATGACTTCCTG 

T TC CATC CATGT TGCT GCAAAT GACAGGAT T TC GT TC TT AAT T T CAAT TAAAAT AACCACACAT G 
*** GCAAAAA 



AppJD=10063602 



Page 279 of 3 18 



FIGURE lao 

MGLLLLVLFLSLLPVAYTIMSLPPSFDCGPFRCRVSVAREHLPSRGSLLRGPRPRIPVLVSCQPV 
KGHGTLGESPMPFKRVFCQDGNVRSFCVCAVHFSSHQPPVAVECLK 

Important features of the protein: 
Signal peptide: 

amino acids 1-18 

N-myristoylation site. 

amino acids 86-92 

Zinc carboxypeptidases , zinc-binding region 2 signature ♦ 

amino acids 68-79 
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FIGURE 131 



TTCTGAAGTAACGGAAGCTACCTTGTATAAAGACCTCAACACTGCTGACC ATGA TCAGCGCAGCCTGGAGC 
ATCTTCCTCATCGGGACTAAAATTGGGCTGTTCCTTCAAGTAGCACCTCTATCAGTTATGGCTAAATCCTG 
TCCATCTGTGTGTCGCTGCGATGCGGGTTTCATTTACTGTAATGATCGCTTTCTGACATCCATTCCAACAG 
GAAT ACCAGAGGAT GCTACAACTCTCTACCTTCAGAAC AACCAAAT AAAT AATGCTGGGATT CCTTCAGAT 
TT GAAAAAC T T GCT GAAAGT AG AAAGAAT AT ACC T AT ACC AC AAC AGT TT AGAT GAATTT CC T ACC AACCT 
C C CAAAGT AT GT AAAAGAGT T ACATT TGCAAGAAAAT AAC AT AAGG ACTATC AC T T ATGAT T CACT TTC AA 
AAATTCCCTATCTGGAAGAATTACATTTAGATGACAACTCTGTCTCTGCAGTTAGCATAGAAGAGGGAGCA 
h 4 TTCCGAGACAGCAACTATCTCCGACTGCTTTTCCTGTCCCGTAATCACCTTAGCACAATTCCCTGGGGTTT 
GCCC AGGACT AT AGAAGAACT AC GCTTGGATGAT AATC GC AT AT CC ACTAT T TC AT C ACC AT CTCTT C AAG 

% *f GTCT C ACT AGTCT AAAACGC CT GGTTCT AG AT GGAAACCTG T TGAAC AAT CAT GGT TT AGGT GAC AAAGT T 

CI 

r . TTCTTCAACCTAGTTAATTTGACAGAGCTGTCCCTGGTGCGGAATTCCCTGACTGCTGCACCAGTAAACCT 

1^ TCCAGGCACAAACCTGAGGAAGCTTTATCTTCAAGATAACCACATCAATCGGGTGCCCCCAAATGCTTTTT 

J* cttatctaaggcagctctatcgactggatatgtccaataataacctaagtaatttacctcagggtatcttt 
FP gatgatttggacaatataacacaactgattcttcgcaacaatccctggtattgcgggtgcaagatgaaatg 
s ggtacgtgactggttacaatcactacctgtgaaggtcaacgtgcgtgggctcatgtgccaagccccagaaa 
13 aggttcgtgggatggctattaaggatctcaatgcagaactgtttgattgtaaggacagtgggattgtaagc 
III accattcagataaccactgcaatacccaacacagtgtatcctgcccaaggacagtggccagctccagtgac 
U caaacagccagatattaagaaccccaagctcactaaggatcaacaaaccacagggagtccctcaagaaaaa 

caattacaattactgtgaagtctgtcacctctgataccattcatatctcttggaaacttgctctacctatg 
actgctttgagactcagctggcttaaactgggccatagcccggcatttggatctataacagaaacaattgt 
aacaggggaacgcagtgagtacttggtcacagccctggagcctgattcaccctataaagtatgcatggttc 
ccatggaaaccagcaacctctacctatttgatgaaactcctgtttgtattgagactgaaactgcacccctt 
cgaatgtacaaccctacaaccaccctcaatcgagagcaagagaaagaaccttacaaaaaccccaatttacc 
tttggctgccatcattggtggggctgtggccctggttaccattgcccttcttgctttagtgtgttggtatg 
ttcataggaatggatcgctcttctcaaggaactgtgcatatagcaaagggaggagaagaaaggatgactat 
gcagaagctggcactaagaaggacaactctatcctggaaatcagggaaacttcttttcagatgttaccaat 
aagcaatgaacccatctcgaaggaggagtttgtaatacacaccatatttcctcctaatggaatgaatctgt 
acaaaaacaatcacagtgaaagcagtagtaaccgaagctacagagacagtggtattccagactcagatcac 
tcacactcatgatgctgaaggactcacagcagacttgtgttttgggttttttaaacctaagggaggtgatg 

GT 
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FIGURE 132 



MISAAWSIFLIGTKIGLFLQVAPLS\ T MAKSCPSVCRCDAGFIYCNDRFLTSIPTGIPEDATTLYL 
QNNQ INNAGI PS DLKNLLKVERI YLYHNS LDE FPTNLPKYVKE LHLQENNI RT I TYDS LSKI P YL 
EELHLDDNSVSAVSIEEGAFRDSNYLRLLFLSRNHLSTIPWGLPRTIEELRLDDNRISTISSPSL 
QGLTSLKRLVLDGNLLNNHGLGDKVFFNLVNLTELSLVRNSLTAAPVNLPGTNLRKLYLQDNHIN 
RVPPNAFSYLRQLYRLDMSNNNLSNLPQGIFDDLDNITQLILRNNPWYCGCKMKWVRDWLQSLPV 
KVNVRGLMCQAPEKVRGMAIKDLNAEL^ 

IKNPKLTKDQQTTGSPSRKTI T I TVKSVTSDT IHISWKLALPMTALRLS WLKLGHSPAFGS ITET 
13 I VT GE RSE YLVTALE PDS P YKVCMVPMET S NL YL FDE T PVC I E TET AP LRM YNPTT TLNREQEKE 

O PYKNPNLPLAAIIGGAVALVTIALIALVCWYVHRNGSLFSRNCAYSKGRRRKDDYAEAGTKKDNS 
J r : I LE I RET S FQMLPI SNEP I SKEE FVIHT I FPPNGMNLYKNNHSES S SNRS YRDSG I PDSDHSHS 

m 

pj Important features of the protein: 

Signal peptide: 
amino acids 1-28 
Transmembrane domain: 

ffi amino acids 531-552 

|g| N-glycosylation sites* 

p amino acids 226-229, 232-285, 296-299, 555-558, 626-629, 633-636 

?S? Tyrosine kinase phosphorylation site. 

amino acids 515-522 

N-myr i s toyl a ti on s i te s . 

amino acids 12-17, 172-177, 208-213, 359-364, 534-539, 556-561, 
640-645 

Ami da ti on site. 

amino acids 567-570 
Leucine zipper pattern. 

amino acids 159-180 

Phospholipase A2 aspartic acid active site. 

amino acids 34-44 



2 m 
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FIGURE taa 



CCGTCATCCCCCTGCAGCCACCCTTCCCAGAGTCCTTTGCCCAGGCCACCCCAGGCTTCTTGGCA 
GCCCTGCCGGGCCACTTGTCTTCATGTCTGCCAGGGGGAGGTGGGAAGGAGGTGGGAGGAGGGCG 
TGCAGAGGCAGTCTGGGCTTGGCCAGAGCTCAGGGTGCTGAGCGTGTGACCAGCAGTGAGCAGAG 
GCCGGCCATGGCCAGCCTGGGGCTGCTGCTCCTGCTCTTACTGACAGCACTGCCACCGCTGTGGT 
CCTCCTCACTGCCTGGGCTGGACACTGCTGAAAGTAAAGCCACCATTGCAGACCTGATCCTGTCT 
GCGCTGGAGAGAGCCACCGTCTTCCTAGAACAGAGGCTGCCTGAAATCAACCTGGATGGCATGGT 
GGGGGTCCGAGTGCTGGAAGAGCAGCTAAAAAGTGTCCGGGAGAAGTGGGCCCAGGAGCCCCTGC 
TGCAGCCGCTGAGCCTGCGCGTGGGGATGCTGGGGGAGAAGCTGGAGGCTGCCATCCAGAGATCC 
CTCCACTACCTCAAGCTGAGTGATCCCAAGTACCTAAGAGAGTTCCAGCTGACCCTCCAGCCCGG 
GTTTTGGAAGCTCCCACATGCCTGGATCCACACTGATGCCTCCTTGGTGTACCCCACGTTCGGGC 
!!f CCCAGGACTCATTCTCAGAGGAGAGAAGTGACGTGTGCCTGGTGCAGCTGCTGGGAACCGGGACG 
^ GACAGCAGCGAGCCCTGCGGCCTCTCAGACCTCTGCAGGAGCCTCATGACCAAGCCCGGCTGCTC 
fff AGGCTACTGCCTGTCCCACCAACTGCTCTTCTTCCTCTGGGCCAGAATGAGGGGATGCACACAGG 
3 GAC CAC T CCAACAGAGCCAGGACT AT AT CAACC T C T T CT GC GCCAACATGATGGACT T GAACCGC 

W AGAGCTGAGGCCATCGGATACGCCTACCCTACCCGGGACATCTTCATGGAAAACATCATGTTCTG 

if 

y TGGAATGGGCGGCTTCTCCGACTTCTACAAGCTCCGGTGGCTGGAGGCCATTCTCAGCTGGCAGA 
}5 AACAGCAGGAAGGAT GCT T CGG GGAG CCT GAT G C TGAAGAT GAAGAAT TAT C T AAAGCT AT T CAA 

m TATCAGCAGCAT TTT TCGAGGAGAGTGAAGAGGCGAGAAAAACAATT TCCAGAT TCTCGCTCTGT 

?! TGCTCAGGCTGGAGTACAGTGGCGCAATCTCGGCTCACTGCAACCTTTGCCTCCTGGGTTCAAGC 

AAT TCTCTTGCCT C AT CC T CCC GAG T AGC T GGG AC TACAGGAGCGTGCCACCAT AC CT GGC T AAT 
TTTTATATTTTTTTAGTAGAGACAGGGTTTCATCATGTTGCTCATGCTGGTCTCGAACTCCTGAT 
CTCAAGAGATCCGCCCACCTCAGGCTCCCAAAGTGTGGGATTATAGGTGTGAGCCACCGTGTCTG 
GCTGAAAAGCACTTTCAAAGAGACTGTGTTGAATAAAGGGCCAAGGTTCTTGCCACCCAGCACTC 
ATGGGGGCTCTCTCCCCTAGATGGCTGCTCCTCCCACAACACAGCCACAGCAGTGGCAGCCCTGG 
GTGGCTTCCTATACATCCTGGCAGAATACCCCCCAGCAAACAGAGAGCCACACCCATCCACACCG 
CCACCACCAAGCAGCCGCTGAGACGGACGGTTCCATGCCAGCTGCCTGGAGGAGGAACAGACCCC 
TTTAGTCCTCATCCCTTAGATCCTGGAGGGCACGGATCACATCCTGGGAAGAAGGCATCTGGAGG 
ATAAGCAAAGCCACCCCGACACCCAATCTTGGAAGCCCTGAGTAGGCAGGGCCAGGGTAGGTGGG 
GGCCGGGAGGGACC CAGG T GT GAACGGAT GAATAAAGT T CAAC TGCAACTGAAAAAAAAAAA 
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FIGURE 134 



MSARGRWEGGGRRACRGSLGLARAQGAERVTSSEQRPA^SLGLLLLLLLTALPPLWSSSLPGLD 
TAESKATIADLILSALERATVFLEQRLPEINLDGMVGVRVLEEQLKSVREKWAQEPLLQPLSLRV 
GMLGEKLEAAIQRSLHYLKLSDPKYLREFQLTLQPGFWKLPHAWIHTDASLVYPTFGPQDSFSEE 
RSDVCLVQLLGTGTDSSEPCGLSDLCRSLMTKPGCSGYCLSHQLLFFLWARMRGCTQGPLQQSQD 
YINLFCANMMDLNRRAEAIGYAYPTRDIFMENIMFCGMGGFSDFYKLRWLEAILSWQKQQEGCFG 
EPDAEDEELSKAIQYQQHFSRRVKRREKQFPDSRSVAQAGVQWRNLGSLQPLPPGFKQFSCLILP 
SSWDYRSVPPYLANFYIFLVETGFHHVAHAGLELLISRDPPTSGSQSVGL 

Important features of the protein : 
Signal peptide: 

amino acids 1-2 6 



IP Transmembrane domain: 

amino acids 39-56 



Tyrosine kinase phosphorylation sites. 

amino acids 149-156, 274-282 

N-myristoylation sites. 

amino acids 10-16, 20-26, 63-69, 203-214 



Amidation site. 

amino acids 10-14 

Glycoprotein hormones beta chain signature 1. 

amino acids 230-237 
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FIGURE 135 



GGT CT GAGT GCAGAGCTGCT GT CATGGCGGCCGC'i'CTGT GGGGCT TCTT TCCCG'l'CCT GCTGCT G 
CTGCTGCTATCGGGGGATGTCCAGAGCTCGGAGGTGCCCGGGGCTGCTGCTGAGGGATCGGGAGG 
GAGTGGGGTCGGCATAGGAGATCGCTTCAAGATTGAGGGGCGTGCAGTTGTTCCAGGGGTGAAGC 
CTCAGGACTGGATCTCGGCGGCCCGAGTGCTGGTAGACGGAGAAGAGCACGTCGGTTTCCTTAAG 
ACAGATGGGAGTTTTGTGGTTCATGATATACCTTCTGGATCTTATGTAGTGGAAGTTGTATCTCC 
AGCTTACAGATTTGATCCCGTTCGAGTGGATATCACTTCGAAAGGAAAAATGAGAGCAAGATATG 
TGAATTACATCAAAACATCAGAGGTTGTCAGACTGCCCTATCCTCTCCAAATGAAATCTTCAGGT 
. CCACCTTCTTACTTTATTAAAAGGGAATCGTGGGGCTGGACAGACTTTCTAATGAACCCAATGGT 
TAT GATGAT GGT TC T TCC T T TAT TGAT AT T TGT GOT T CT GCCTAAAGT GG TCAACACAAGTGATC 
C TGACATGAGACGGGAAATGGAGCAGTCAATGAATAT GCTGAAT TCCAACCATGAGTTGCCTGAT 
GTTTCTGAGTTCATGACAAGACTCTTCTCTTCAAAATCATCTGGCAAATCTAGCAGCGGCAGCAG 
TAAAACAGGCAAAAGTGGGGCTGGCAAAAGGAGG TAG TCAGGCCGTCCAGAGCTGGCATTTGCAC 
AAACACGGCAACACTGGGTGGCATCCAAGTCTTGGAAAACCGTGTGAAGCAACTACTATAAACTT 
GAGTCATCCCGACGTTGATCTCTTACAACTGTGTATGTT 

AACTTTTTAGCACATGTTTTGTACTTGGTACACGAGAAAACCCAGCTTTCATCTTTTGTCTGTAT 
GAGG TCAAT AT T GATGTC AC TGAAT T AAT T ACAGTG T CC T ATAGAAAATGCCAT T AAT AAAT TAT 
ATGAACT AC TATACAT TATG TATATT AAT TAAAACAT CT TAAT CCAGAAATCAAAAAAAAAAAAA 



"3 AAAAAAAAAAAAAAA 
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FIGURE 136 



MAAALWGFFPVLLLLLLSGDVQSSEVPGAAAEGSGGSGVGIGDRFKIEGRAWPGVKPQDWISAA 
RVLVDGE EHVG FLKTDGS FWHD IPS GSYWEWS PAYRFDPVRVD I T SKGKMRARYVNY I KTS E 
VVRLPYPLQMKSSGPPSYFIKRE3WGWTDFLMNPMVMMMVLPLLIFVLLPKVVNT 
QSMNMLNSNHELPDVSEFMTRLF3SKSSGKSSSGSSKTGKSGAGKRR 



Important features of the protein: 
y % Signal sequence: 

fp amino acids 1-23 

Transmembrane domain: 

«*L amino acids 161-182 

a 

I'll N-glycosylation site. 

* amino acids 184-187 

IB 

^ Glycosaminoglycan attachment sites. 

M amino acids 37-40, 236-239 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 151-154 



N-myristoylation sites. 

amino acids 33-38, 36-41, 38-44, 229-234 

Amidation site. 

amino acids 238-241 



ATP/GTP-binding site motif A (P-loop) . 

amino acids 229-236 



App. 
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FIGURE 137 



GATGGCGCAGCCACAGCTTCTGTGAGATTCGATTTCTCCCCAGTTCCCCTGTGGGTCTGAGGGGA 
CCAGAAGGGTGAGCTACGTTGGCTTTCTGGAAGGGGAGGCTATATGCGTCAATTCCCCAAAACAA 
GTTTTGACATTTCCCCTGAAATGTCATTCTCTATCTATTCACTGCAAGTGCCTGCTGTTCCAGGC 
CTTACCTGCTGGGCACTAACGGCGGAGCCAGGATGGGGACAGAATAAAGGAGCCACGACCTGTGC 
CACCAACTCGCACTCAGACTCTGAACTCAGACCTGAAATCTTCTCTTCACGGGAGGCTTGGCAGT 
T T TT CTT ACTCC T G T GGTCT CCAGAT T TC AGGC C TAAGAT GAAAGCC T CTAG TC T T G C CT TCAGC 
CTTCTCTCTGCTGCGTTTTATCTCCTATGGACTCCTTCCACTGGACTGAAGACACTCAATTTGGG 
AAGCT GT GT GATCGCCACAAACCTTCAGGAAATACGAAATGGAT TTTC TGAGATACGGGGCAGTG 
TGCAAGCCAAAGATGGAAACATTGACATCAGAATCT TAAGGAGGACTGAGTC TT TGCAAGACACA 
AAGCCTGCGAATCGATGCTGCCTCCTGCGCCATTTGCTAAGACTCTATCTGGACAGGGTATTTAA 
AAACTACCAGACCCCTGACCATTATACTCTCCGGAAGATCAGCAGCCTCGCCAATTCCTTTCTTA 
CCATCAAGAAGGACCTCCGGCTCTCTCATGCCCACATGACATGCCATTGTGGGGAGGAAGCAATG 
AAGAAAT ACAGC CAGATTCTGAG TCACT TTGAAAAG CT GGAACCTCAGGC AGCAG T TGTGAAGGC 
TTTGGGGGAACTAGACATTCTTCTGCAATGGATGGAGGAGACAGAATAGGAGGAAAGTGATGCTG 
CTGCTAAGAATATTCGAGGTCAAGAGCTCCAGTCTTCAATACCTGCAGAGGAGGCATGACCCCAA 
ACCACCATCTCTTTACTGTACTAGTCTTGTGCTGGTCACAGTGTATCTTATTTATGCATTACTTG 
CTTCCTTGCATGATTGTCTTTATGCATCCCCAATCTTAATTGAGACCATACTTGTATAAGATTTT 
TGTAATATCTTTCTGCTATTGGATATATTTATTAGTTAATATATTTATTTATTTTTTGCTATTTA 
AT GT AT T TAT T T T T T T AC T T GG AC AT G AAAC T T T AAAAAAAT T C AC AGAT T ATAT T T AT AAC C T G 
ACT AGAGCAGGT GATG TAT TTT TATACAGTAAAAAAAAAAAAC CT TGTAAAT TC TAGAAGAGT GG 
CTAGGGGGGTTATTCATTTGTATTCAACTAAGGACATATTTACTCATGCTGATGCTCTGTGAGAT 
ATTTGAAATTGAACCAATGACTACTTAGGATGGGTTGTGG7VATAAGTTTTGATGTGGAATTGCAC 
ATCTACCT TACAAT TACTGACCATCCCCAGTAGACTCCCCAGT CCCATAATTGT GTAT CT TCCAG 
CCAGGAATCCTACACGGCCAGCATGTATTTCTACAAATAAAGTTTTCTTTGCATACCAAAAAAAA 
AAAAAAAAAAA 
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FIGURE 138 



MRQFPKTSFDISPEMSFSIYSLQVPAVPGLTCWALTAEPGWGQNKGATTCATNSHSDSELRPEIF 
SSREAWQFFLLLWSPDFRPKMKASSLAFSLLSAAFYLLWTPSTGLKTLNLGSCVIATNLQEIRNG 
FSEIRGSVQAKDGNIDIRILRRTESLQDTKPANRCCLLRHLLRLYLDRVFKNYQTPDHYTLRKIS 
SLAN5FLTIKKDLRLSHAHMTCHCGEEAMKKYSQILSHFEKLEPQAAWKALGELDILLQWMEET 
E 



I . Important features of the protein: 

|;j Signal peptide: 

|3 amino acids 1-42 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 192-195, 225-228 

N-myristoylation sites. 

amino acids 42-47, 46-51, 136-141 
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FIGURE 13Q 



m 



CCTGGAGCCGGAAGCGCGGCTGCAGCAGGGCGAGGCTCCAGGTGGGGTCGGTTCCGCATCCAGCC 
TAGCGTGTCCACGATGCGGCTGGGCTCCGGGACTTTCGCTACCTGTTGCGTAGCGATCGAGGTGC 
TAGGGATCGCGGTCTTCCTTCGGGGATTCTTCCCGGCTCCCGTTCGTTCCTCTGCCAGAGCGGAA 
CACGGAGCGGAGCCCCCAGCGCCCGAACCCTCGGCTGGAGCCAGTTCTAACTGGACCACGCTGCC 
ACCACCTCTCTTCAGTAAAGTTGTTATTGTTCTGATAGATGCCTTGAGAGATGATTTTGTGTTTG 
GGTCAAAGGGTGTGAAATTTATGCCCTACACAACTTACCTTGTGGAAAAAGGAGCATCTCACAGT 
T TT G T GGC TGAAGCAAAGCCAC C T ACAGT TACT ATGC CT CGAAT CAAGGCAT TGATGACGGGGAG 
CCTTCCTGGCTTTGTCGACGTCATCAGGAACCTCAATTCTCCTGCACTGCTGGAAGACAGTGTGA 
TAAGACAAGCAAAAGC AGC T GGAAAAAGAAT AG T C T T T TAT GGAGAT G AAAC CT GGG T T AAAT TA 
TTCCCAAAGCAT TTTGTGGAATATGATGGAACAACCT CAT T TTT CGTGTCAGATTACACAGAGGT 
GGATAATAATGTCACGAGGCATT TGGATAAAGTATTAAAAAGAGGAGATTGGGACATAT TAATCC 
TCCACTACCTGGGGCTGGACCACATTGGCCACATTTCAGGGCCCAACAGCCCCCTGATTGGGCAG 
AAG C T GAGC GAGAT GGACAGC G T G C T GAT GAAGAT C C AC AC C T CAC T GC AGT CGAAG GAGAGAGA 
GACGCCTTTACCCAATTTGCTGGTTCTTTGTGGTGACCATGGCATGTCTGAAACAGGAAGTCACG 
GGGCCTCCTCCACCGAGGAGGTGAATACACCTCTGATTTTAATCAGTTCTGCGTTTGAAAGGAAA 
CCCGGTGATATCCGACATCCAAAGCACGTCCAATAGACGGATGTGGCTGCGACACTGGCGATAGC 
jgj ACTTGGCTTACCGATTCCAAAAGACAGTGTAGGGAGCCTCCTATTCCCAGTTGTGGAAGGAAGAC 
J*£j CAATGAGAGAGCAGT TGAGATT TTTACATTTGAATACAGTGCAGCTTAGTAAACTGTTGCAAGAG 

AATG T GCCGTCATAT GAAAAAGAT CC T GGGT T TGAGCAGT T TAAAAT G TCAGAAAGAT T GCAT GG 
III GAACT GGAT CAGAC T G T ACT TGGAGGAAAAGCAT TCAGAAGT CCTAT TCAACCT GGGC TC CAAGG 

* TTCTCAGGCAGTACCTGGATGCTCTGAAGACGCTGAGCTTGTCCCTGAGTGCACAAGTGGCCCAG 
Q TTCTCACCCTGCTCCTGCTCAGCGTCCCACAGGCACTGCACAGAAAGGCTGAGCTGGAAGTCCCA 
jp CTGTCATCTCCTGGGTTTTCTCTGCTCTTTTATTTGGTGATCCTGGTTCTTTCGGCCGTTCACGT 
2 CATTGTGTGCACCTCAGCTGAAAGTTCGTGCTACTTCTGTGGCCTCTCGTGGCTGGCGGCAGGCT 
U GCCTTTCGTTTACCAGACTCTGGTTGAACACCTGGTGTGTGCCAAGTGCTGGCAGTGCCCTGGAC 
||| AGGGGGCCTCAGGGAAGGACGTGGAGCAGCCTTATCCCAGGCCTCTGGGTGTCCCGACACAGGTG 
fl TTCACATCTGTGCTGTCAGGTCAGATGCCTCAGTTCTTGGAAAGCTAGGTTCCTGCGACTGTTAC 
S CAAGGTGAT TGTAAAGAGCT GGCGGT CACAGAGGAACAAGC CCCCCAGCTGAGGGGGT GT GTGAA 

?w TCGGACAGCCTCCCAGCAGAGGTGTGGGAGCTGCAGCTGAGGGAAGAAGAGACAATCGGCCTGGA 

CACTCAGGAGGGTCAAAAGGAGACTTGGTCGCACCACTCATCCTGCCACCCCCAGAATGCATCCT 
GCCTCATCAGGTCCAGATTTCTTTCCAAGGCGGACGTTTTCTGTTGGAATTCTTAGTCCTTGGCC 
T CGGACACC T TCAT TC GTTAGC T GGGGAG TGGTGGT GAGGC AGT GAAGAAGAGGCGGATGG T CAC 
AC T CAGATCCACAGAGCC CAGGAT CAAGGGACCCACT GC AGTGGCAGCAGGACT G T TGGGCCCCC 
ACCCCAACCCTGCACAGCCCTCATCCCCTCTTGGCTTGAGCCGTCAGAGGCCCTGTGCTGAGTGT 
C TGAC CGAGACACTCACAGC T T T GT CATC AG GGC AC AG GCT T CCT CGGAGCCAGGATGAT C TG T G 
CCACGCTTGCACCTCGGGCCCATCTGGGCTCATGCTCTCTCTCCTGCTATTGAATTAGTACCTAG 
C TGCACACAGTATGTAGTTACCAAAAGAATAAACGGCAATAAT TGAGAAAAAAAA 



App_ID=10063602 



Page 289 of 3 18 



FIGURE 14Q 

MRLGSGTFATCCVAIEVLGIAVFLRGFFPAPVRSSARAEHGAEPPAPEPSAGASSNWTTLPPPLF 
SKWIVLIDALRDDFVFGSKGVKMPYTTYLVEKGASHSFVAEAKPPTVTMPRIKALMTGSLPGF 
VDVIRNLNSPALLEDSVIRQAKAAGKRIVFYGDETWVKLFPKHFVEYDGTTSFFVSDYTEVDNNV 
TRHLDKVLKRGDWDILILHYLGLDHIGHISGPNSPLIGQKLSEMDSVLMKIHTSLQSKERETPLP 
NLLVLCGDHGMSE TGSHGASS TEEVNT PL I L I S S AFERKPGD I RHPKH VQ 



11 



Important features of the protein: 
Signal peptide: 



CJ amino acids 1-34 

9 

%l Transmembrane domain: 

amino acids 58-7 6 

5 9* 



N-glycosylation sites. 

amino acids 56-60, 194-198 

N-myristoylation sites. 

amino acids 6-12, 52-58, 100-106, 125-131, 233-239, 270-276, 
275-281, 278-284 



Amidation site. 

amino acids 154-158 



Cell attachment sequence. 

amino acids 205-208 
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FIGURE iai 



GGCACGAGGCAAGCCTTCCAGGTTATCGTGACGCACCTTGAAAGTCTGAGAGCTACTGCCCTACA 
GAAAGTTACTAGTGCCCTAAAGCTGGCGCTGGCACTGATGTTACTGCTGCTGTTGGAGTACAACT 
T CC C T AT AGAAAACAAC T GCCAGCACC T T AAGACCAC T CAC ACCT TC AGAGT GAAGAACT TAAAC 
CCGAAGAAATTCAGCATTCATGACCAGGATCACAAAGTACTGGTCCTGGACTCTGGGAATCTCAT 
AGCAGTTCCAGATAAAAACTACATACGCCCAGAGATCTTCTTTGCATTAGCCTCATCCTTGAGCT 
CAGCCTCTGCGGAGAAAGGAAGTCCGATTCTCCTGGGGGTCTCTAAAGGGGAGTTTTGTCTCTAC 
TGTGACAAGGATAAAGGACAAAGTCATCCATCCCTTCAGCTGAAGAAGGAGAAACTGATGAAGCT 
GGCTGCCCAAAAGGAATCAGCACGCCGGCCCTTCATCTTTTATAGGGCTCAGGTGGGCTCCTGGA 
ACATGCTGGAGTCGGCGGCTCACCCCGGATGGTTCATCTGCACCTCCTGCAATTGTAATGAGCCT 
G T T GG GGTGACAGATAAAT T TGAGAACAGGAAACACATT GAAT T T TCATT TCAACCAGT T T GCAA 
AGCTGAAAT GAGCCCCAGTGAGG TCAGCGAT TAGGAAAC T GCC CCAT TGAAC GC C T TC CT CGC TA 



Ft ATTTGAACTAATTGTATAAAAACACCAAACCTGCTCACT 

rip 
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FIGURE 142 

MLLLLLEYNFPIENNCQHLKTTHTFRVKNLNPKKFSIHDQDHKVLVLDSGNLIAVPDKNYIRPEI 
FFAIiASSLSSASAEKGSPILLGVSKGEFCLYCDKDKGQSHPSLQLKKEKLMKLAAQKESARRPFI 
FYRAQVGSWNMLESAAHPGWFICTSCNCNEPVGVTDKFENRKHIEFSFQPVCKAEMSPSEVSD 

cAMP- and c(3>lP-dependent protein kinase phosphorylation site. 

amino acids 33-36 

N-rayristoylation site. 

amino acids 50-55, 87-92 

Interleukin-1 

amino acids 37-182 
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FIGURE 143 



C TAGAGAGTATAGGGCAGAAGGAT GGCAGATGAGTGACT CCACATCCAGAGC TGCCTCCCT TTAA 
TCCAGGATCCTGTCCTTCCTGTCCTGTAGGAGTGCCTGTTGCCAGTGTGGGGTGAGACAAGTTTG 
TCCCACAGGGCTGTCTGAGCAGATAAGATTAAGGGCTGGGTCTGTGCTCAATTAACTCCTGTGGG 
CACGGGGGCTGGGAAGAGCAAAGTCAGCGGTGCCTACAGTCAGCACCATGCTGGGCCTGCCGTGG 
AAGGGAGGTCTGTCCTGGGCGCTGCTGCTGCTTCTCTTAGGCTCCCAGATCCTGCTGATCTATGC 
C TGGCAT T T CCACGAGCAAAGGGAC T G T GAT GAACACAATGT CAT GGC TCGT TACC T CCCT GCCA 
CAGTGGAG TT TGCTGTCCACACAT TCAACCAACAGAGCAAGGACTACTATGCCTACAGACTGGGG 
CACATCTTGAATTCCTGGAAGGAGCAGGTGGAGTCCAAGACTGTATTCTCAATGGAGCTACTGCT 

S ; 

fS GGGGAGAAC TAGGT GT GGGAAAT T T GAAGACGACAT T GACAACT GC CAT T TCCAAGAAAGCAC AG 

m AGCT GAACAATACT TT CACCTGC T T C T T CACCAT CAGCACCAGG CC C TGGATGAC TCAG TT CAGC 

III CTCCTGAACAAGACCTGCTTGGAGGGATTCCACTGAGTGAAACCCACTCACAGGCTTGTCCATGT 

GCTGCTCCCACATTCCGTGGACATCAGCACTACTCTCCTGAGGACTCTTCAGTGGCTGAGCAGCT 
TTGGACTTGTTTGTTATCCTATTTTGCATGTGTTTGAGATCTCAGATCAGTGTTTTAGAAAATCC 
ACACATC T T GAG CC TAAT CAT G TAG T G T AGAT CAT T AAAC AT CAGC AT T T T AAGAAAAAAAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 



AppJD=10063602 



Page 293 of 3 18 



m 



FIGURE 144 



MLGLPWKGGLSWALLLLLLGSQILLIYAWHFHEQRDCDEHNVMARYLPATVEFAVHTFNQQSKDY 
YAYRLGHILNSWKEQVESKTVFSMELLLGRTRCGKFEDDIDNCHFQESTELNWTFTCFFTISTRP 
WMTQFSLLNKTCLEGFH 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 



t*\ N-glycosylation sites. 

p amino acids 117-121, 139-143 



N-myristoylation site. 

amino acids 9-15 
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FIGURE 145 



CTGTGCAGCTCGAGGCTCCAGAGGCACACTCCAGAGAGAGCCAAGGTTCTGACGCGATGAGGAAG 
CACCTGAGCTGGTGGTGGCTGGCCACTGTCTGCATGCTGCTCTTCAGCCACCTCTCTGCGGTCCA 
GACGAGGGGCATCAAGCACAGAATCAAGTGGAACCGGAAGGCCCTGCCCAGCACTGCCCAGATCA 
CTGAGGCCCAGGTGGCTGAGAACCGCCCGGGAGCCTTCATCAAGCAAGGCCGCAAGCTCGACATT 
GACTTCGGAGCCGAGGGCAACAGGTACTACGAGGCCMlCTACTGGCAGTTCCCCGATGGCATCCA 
CTACAACGGCTGCTCTGAGGCTAATGTGACCAAGGAGGCATTTGTCACCGGCTGCATCAATGCCA 
CCCAGGCGGCGAACCAGGGGGAGTTCCAGAAGCCAGACAACAAGCTCCACCAGCAGGTGCTCTGG 
V* CGGCTGGTCCAGGAGCTCTGCTCCCTCAAGCATTGCGAGTTTTGGTTGGAGAGGGGCGCAGGACT 
f\ TCGGGTCACCATGCACCAGCCAGTGCTCCTCTGCCTTCTGGCTTTGATCTGGCTCATGGTGAAAT 
p AAGC T T GCCAGGAGGC TGGCAGTACAGAGCGCAGCAGCGAGCAAATCC T GGCAAG T GACC CAGC T 

ill CTTCTCCCCCAAACCCACGCGTGTTCTGAAGGTGCCCAGGAGCGGCGATGCACTCGCACTGCAAA 

TGCCGCTCCCACGTATGCGCCCTGGTATGTGCCTGCGTTCTGATAGATGGGGGACTGTGGCTTCT 
CCGTCAC TC CAT TC T CAGCCCC TAG CAGAGC G T C TG GCACAC T AGAT TAG TAG T AAAT G CT T GAT 
G AGAAGAAC AC AT CAGGC ACT GCGCC ACC TGC T T CAC AGT AC T T CC CAACAACT C T TAGAGGTAG 
GTGTATTCCCGTTTTACAGATAAGGAAACTGAGGCCCAGAGAGCTGAAGTACTGCACCCAGCATC 
ACCAGCTAGAAAGTGGCAGAGCCAGGATTCAACCCTGGCTTGTCTAACCCCAGGTTTTCTGCTCT 
J*t GTCCAATTCCAGAGCTGTCTGGTGATCACTTTATGTCTCACAGGGACCCACATCCAAACATGTAT 

C T C T AAT G AAA T T G T G AAAG C T C C AT G T T T A GAAA T AAAT G AAAAC AC C T G A 

in 
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FIGURE 146 



MRKHLSWWWLATVCMLLFSHLSAVQTRGIKHRIKWNRKALPSTAQITEAQVAENRPGAFIKQGRK 
LDIDFGAEGNRYYEANYWQFPDGIHYNGCSEANVTKEAFVTGCINATQAANQGEFQKPDNKLHQQ 
VLWRLVQELCSLKHCEFWLERGAGLRVTMHQPVLLCLLALIWLMVK 

Important features of the protein: 
Signal peptide: 

amino acids 1-26 



Transmembrane domain: 



TBI? 

m 

~" amino acids 157-171 



N-glycosylation sites. 

||i amino acids 98-102, 110-114 

Tyrosine kinase phosphorylation site. 

amino acids 76-83 

Q N-myristoylation sites. 

Ill amino acids 71-77, 88-94, 93-99, 107-113, 154-160 

Amidation site. 

amino acids 62-66 
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FIGURE 147 

GCCTTGGCCTCCCAAAGGGCTGGGATTATAGGCGTGACCACCATGTCTGGTCCAGAGTCTCATTT 
CCTGATGATTTATAGACTCAAAGAAAACTC ATG TTCAGAAGCTCTCTTCTCTTCTGGCCTCCTCT 
CTGTCTTCTTTCCCTCTTTCTTCTTATTTTAATTAGTAGCATCTACTCAGAGTCATGCAAGCTGG 
AAATCTTTCATTTTGCTTGTCAGTGGGGTAGGTCACTGAGTCTTAGTTTTTATTTTTTGAAATTT 
CAAC T T T C AG AT T C AGGG G G TAG ATGTGAAGG T T T G T T T TAT GAGTAT AT T GC ATGAT G C T GAGG 
TTTGGGGT 
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FIGURE 148 



III 

CI 

ri 



MFRSSLLFWPPLCLLSLFLLILISSIYSESCKLEIFHFACQWGRSLSLSFYFLKFQLSDSGGTCE 
GLFYEYIA 

Important features of the protein: 
Signal peptide: 

amino acids 1-25 

N-myristoylation site. 

amino acids 62-68 
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FIGURE ldQ 



GTCTCCGCGT C ACAG GAACT T CAG CAC C C AC AG G G C GGAC AG CGCTCCCCTC T AC CT GGAGAC T T GAC 
TCCCGCGCGCCCCAACCCTGCTTATCCCTTGACCGTCGAGTGTCAGAGATCCTGCAGCCGCCCAGTCC 
CGGCCCCTCTCCCGCCCCACACCCACCCTCCTGGCTCTTCCTGTTTTTACTCCTCCTTTTCATTCATA 
ACAAAAGCTACAGCTCCAGGAGCCCAGCGCCGGGCTGTGACCCAAGCCGAGCGTGGAAGAATGGGGTT 
CCTCGGGACCGGCACTTGGATTCTGGTGTTAGTGCTCCCGATTCAAGCTTTCCCCAAACCTGGAGGAA 
GCCAAGACAAATCTCTACATAATAGAGAATTAAGTGCAGAAAGACCTTTGAATGAACAGATTGCTGAA 
G CAGAAGAAGAC AAG AT T AAAAAAAC AT AT C CT C C AGAAAACAAGC CAGG T CAGAG C AAC TAT T C TTT 
? T GT T GATAACT T GAAC C T GC T AAAG GC AATAACAGAAAAG GAAAAAAT T GAGAAAGAAAGACAAT CT A 

:Jf TAAGAAGCTCCCCACTT GATAATAAGT TGAATGTGGAAGATGTTGATTCAACCAAGAAT CGAAAACTG 

ffl ATC GATG AT TATGACTC T ACT AAGAGT GGAT T GGAT CATAAAT TT CAAGATGAT CCAGAT GGTCTTCA 



hi T CAAC TAGACG G GAC T C CT T T AAC C GC T GAAGACAT T GT C CAT AAAAT C GCT G C CAG GAT TTAT GAAG 

p AAAAT GAC AGAGC CGT GT TTGACAAGAT TGT T T CTAAAC TACT TAATCT CGGCCTTATCACAGAAAGC 

13 CAAGCACATACACTGGAAGAT GAAGTAGCAGAGGTT T T AC AAAAAT TAATCT CAAAGGAAGCCAACAA 

III 

TTATGAGGAGGATCCCAATAAGCCCACAAGC TGGACTGAGAATCAGGCT GGAAAAATACCAGAGAAAG 
T GACT CC AAT G G CAG CAAT T C AAG AT GGTCTTGC TAAGG GAGAAAAC GAT GAAAC AG T AT CTAACACA 
1*3 T TAACCT T GACAAATGGCTT GGAAAGGAGAACTAAAACCTACAGTGAAGACAACTT TGAGGAACTCCA 

0 AT AT T TC C CAAAT TTCTATGC GCTACT GAAAAG TAT T GAT T CAG AAAAAGAAG C AAAAG AGAAAGAAA 

bl CAC T GAT T ACTAT CATGAAAACACTGATTGACT TTGT GAAGAT GAT GGT GAAATAT GGAACAATATCT 

O C C A GAAG AAG GTGTTTCC T AC C T T G AAAAC T T G GAT GAAAT GAT T GC T C T T CAG AC C AAAAAC AAG C T 

IP AGAAAAAAATGC TACT GACAATATAAGCAAGCTTTT CCCAGCACCAT CAGAGAAGAGTCATGAAGAAA 

CAGACAGT ACCAAGGAAGAAGCAGCT AAGAT GGAAAAGGAATAT GGAAGCTTGAAGGAT T CCACAAAA 
GATGATAACTCCAACCCAGGAGGAAAGACAGATGAACCCAAAGGAAAAACAGAAGCCTATTTGGAAGC 
CAT CAGAAAAAAT AT T GAAT G GT T G AAGAAAC AT GACAAAAAG G GAAAT AAAGAAGAT TAT GACCT T T 
CAAAGAT GAGAGACT TCATCAATAAACAAGC TGATGC TTATGT GGAGAAAGGCATCCTT GACAAG GAA 
GAAGCCGAGGCCATCAAGCGCATTTATAGCAGCCTGTAAAAATGGCAAAAGATCCAGGAGTCTTTCAA 
CTGTTTCAGAAAACATAATATAGCTTAAAACACTTCTAATTCTGTGATTAAAATTTTTTGACCCAAGG 
G T TAT TAGAAAG T GC T GAAT T TACAGT AG T T AAC CT T TTACAAG T GG T T AAAACAT AGC TTTCTTCCC 
GTAAAAACTATC TGAAAGTAAAGT TGT ATGT AAGCT GAAAAAAAAAAAAAAAAAAAA 
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FIGURE ISO 



MG FLGTGT W I LVLVLP I QAFPKPGGS QDKS LHNRE LS AE RPLNEQ I AEAEEDKI KKTYPPENKPG 
QSNYSFVDNLNLLKAITEKEKIEKERQSIRSSPLDNKLNVEDVDSTKNRKLIDDYDSTKSGLDHK 
FQDDPDGLHQLDGTPLTAEDIVHKIAARIYEENDRAVFDKIVSKLLNLGLITESQAHTLEDEVAE 
VLQKLISKEANNYEEDPNKPTSWTENQAGKIPEKVTPMAAIQDGLAKGENDETVSNTLTLTNGLE 
RRTKTYSEDNFEELQYFPNFYALLKSIDSEKEAKEKETLITIMKTLIDFVKMMVKYGTISPEEGV 
SYLENLDEMIALQTKNKLEKNATDNISKLFPAPSEKSHEETD5TKEEAAKMEKEYGSLKDSTKDD 
NSNPGGKTDEPKGKTEAYLEAIRKNIEWLKKHDKKGNKEDYDLSKMRDFINKQADAYVEKGILDK 
EEAEAIKRIYSSL 

N-glycosylation sites: 

amino acids 68-71, 346-349, 350-353 



Ifs Casein kinase II phosphorylation site: 



amino acids 70-73, 82-85, 97-100, 125-128, 147-150, 188-191, 217- 
220, 265-268, 289-292, 305-308, 320-323, 326-329, 362-365, 368- 
341, 369-372, 382-385, 386-389, 387-390 



fk N-rayristoylation sites: 

|1| amino acids 143-148, 239-244 
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FIGURE 151 



CGGCTCGAGGCTCCCGCCAGGAGAAAGGAACATTCTGAGGGGAGTCTACACCCTGTGGAGCTCAA 
GATGGTCCTGAGTGGGGCGCTGTGCTTCCGAATGAAGGACTCGGCATTGAAGGTGCTTTATCTGC 
ATAAT AACCAGC T TC TAGC TGGAGGGC TGCATGCAGGGAAGGT CAT TAAAGG TGAAGAGAT CAGC 
GTGGTCCCCAATCGGTGGCTGGATGCCAGCCTGTCCCCCGTCATCCTGGGTGTCCAGGGTGGAAG 
CCAGTGCCTGTCATGTGGGGTGGGGCAGGAGCCGACTCTAACACTAGAGCCAGTGAACATCATGG 
AGCTCTATCTTGGTGCCAAGGAATCCAAGAGCTTCACCTTCTACCGGCGGGACATGGGGCTCACC 
TCCAGCTTCGAGTCGGCTGCCTACCCGGGCTGGTTCCTGTGCACGGTGCCTGAAGCCGATCAGCC 
TGTCAGACTCACCCAGCTTCCCGAGAATGGTGGCTGGAATGCCCCCATCACAGACTTCTACTTCC 
AGCAGTGTGACTAGGGCAACGTGCCCCCCAGAACTCCCTGGGCAGAGCCAGCTCGGGTGAGGGGT 
GAGTGGAGGAGACCCATGGCGGACAATCACTCTCTCTGCTCTCAGGACCCCCACGTCTGACTTAG 
TGGGCACCTGACCACTTTGTCTTCTGGTTCCCAGTTTGGATAAATTCTGAGATTTGGAGCTCAGT 
Q CCACGGTCCTCCCCCACTGGATGGTGCTACTGCTGTGGAACCTTGTAAAAACCATGTGGGGTAAA 
f4 CTGGGAATAACATGAAAAGATTTCTGTGGGGGTGGGGTGGGGGAGTGGTGGGAATCATTCCTGCT 
1% TAATGGTAACTGACAAGTGTTACCCTGAGCCCCGCAGGCCAACCCATCCCCAGTTGAGCCTTATA 
f] GGGTCAGTAGCTCTGCACATGAAGTCCTGTCACTCACCACTGTGCAGGAGAGGGAGGTGGTCATA 
W GAGTCAGGGATCTATGGCCCTTGGCCCAGCCCCACCCCCTTCCCTTTAATCCTGCCACTGTCATA 
|f! TGCTACCTTTCCTATCTCTTCCCTCATCATCTTGTTGTGGGCATGAGGAGGTGGTGATGTCAGAA 

GAAATGGCT CGAGCTCAGAAGATAAAAGATAAGTAGGGTAT GCTGATCCTCT TT TAAAAACCCAA 
G AT AC AATC AAAAT CC CAG AT GC T GGT CTC TAT T CCCAT GAAAAAG T G CT CAT GACAT AT T GAGA 
AGACC TACT TACAAAG TGGCAT AT AT T GCAAT T TAT TT T AAT T AAAAGATACCT AT T T AT ATAT T 
TCTTTATAGAAAAAAGTCTGGAAGAGTTTACTTCAATTGTAGCAATGTCAGGGTGGTGGCAGTAT 
AGGTGATTTTTCTTTTAATTCTGTTAATTTATCTGTATTTCCTAATTTTTCTACAATGAAGATGA 
ATTCCTTG T ATAAAAATAAGAAAAGAAAT TAAT CT T GAGG T AAGCAGAGCAGAC AT CATCTC T GA 
TTGTCCTCAGCCTCCACTTCCCCAGAGTAAATTCAAATTG7VATCGAGCTCTGCTGCTCTGGTTGG 
TTGTAGTAGTGATCAGGAAACAGATCTCAGCAAAGCCACTGAGGAGGAGGCTGTGCTGAGTTTGT 
W GTGGC TGGAAT C TCT GGGTAAGGAAC T TAAAGAACAAAAAT CAT C TGG TAAT TC T T TC CT AGAAG 

0 GATCACAGCCCCTGGGATTCCAAGGCATTGGATCCAGTCTCTAAGAAGGCTGCTGTACTGGTTGA 

ATTGTGTCCCCCTCAAATTCACATCCTTCTTGGAATCTCAGTCTGTGAGTTTATTTGGAGATAAG 
GTCTCTGCAGATGTAGTTAGTTAAGACAAGGTCATGCTGGATGAAGGTAGACCTAAATTCAATAT 
GACTGGTTTCCTTGTATGAAAAGGAGAGGACACAGAGACAGAGGAGACGCGGGGAAGACTATGTA 
AAGAT GAAGGCAGAGATCGGAGTTTT GCAGCCACAAGCTAAGAAACACCAAGGAT T GT GGCAACC 
AT C AGAAGCT T GGAAGAGGC AAAGAAGAAT TCT T CCC TAGAGGC T T TAGAGGGATAACGGCT CTG 
C T GAAACC T TAATC T CAGAC TT CC AGCCT CC TGAACGAAGAAAGAATAAAT T TCGGC T G T T T TAA 
GCCAC CAAGGATAAT T GG T T ACAGCAG CT CTAG GAAACTAATACAGCT GC TAAAAT GAT CCC TGT 
CTCCTCGTGTTTACATTCTGTGTGTGTCCCCTCCCACAATGTACCAAAGTTGTCTTTGTGACCAA 
TAGAATATGGCAGAAGTGATGGCATGCCACTTCCAAGATTAGGTTATAAAAGACACTGCAGCTTC 
TACTTGAGCCCTCTCTCTCTGCCACCCACCGCCCCCAATCTATCTTGGCTCACTCGCTCTGGGGG 
AAGCTAGCTGCCATGCTATGAGCAGGCCTATAAAGAGACTTACGTGGTAAAAAATGAAGTCTCCT 
GCCCACAGCCACAT TAGT GAAC C T AGAAGCAGAGACT CTG T GAGATAAT CGATGT T TGT TGT T T T 
AAGTTGCTCAGTTTTGGTCTAACTTGTTATGCAGCAATAGATAAATAATATGCAGAGAAAGAG 



ru 
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FIGURE 152 

MVLSGALC FRMKDSALKVLYLHNNQLLAGGLHAGKVI KGEE I SWPNRWLDAS LS PVI LGVQGGS 
QCLSCGVGQEPTLTLEPVNIMELYLGAKESKSFTFYRRDMGLTSSFESAAYPGWFLCTVPEADQP 
VRLTQLPENGGWNAPITDFYFQQCD 

N-myristoylation sites. 

amino acids 29-34, 30-35, 60-65, 63-68, 73-78, 91-96, 106-111 

Interleukin-1 signature . 

amino acids 111-131 

Interleukin-1 proteins . 

amino acids 8-29, 83-120, 95-134, 64-103 
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FIGURE 153 



C TT CAGAACAGGT TCTCCT T CCCCAGTCACCAG'JLTGCTCGAGT TAGAAT TG'l'CXGC A ATG GCCGC 
CCTGCAGAAATCTGTGAGCTCTTTCCTTATGGGGACCCTGGCCACCAGCTGCCTCCTTCTCTTGG 
CCCTCTTGGTACAGGGAGGAGCAGCTGCGCCCATCAGCTCCCACTGCAGGCTTGACAAGTCCAAC 
TTCCAGCAGCCCTATATCACCAACCGCACCTTCATGCTGGCTAAGGAGGCTAGCTTGGCTGATAA 
CAACACAGACGTTCGTCTCATTGGGGAGAAACTGTTCCACGGAGTCAGTATGAGTGAGCGCTGCT 
ATCTGATGAAGCAGGTGCTGAACTTCACCCTTGAAGAAGTGCTGTTCCCTCAATCTGATAGGTTC 
CAGCCTTATATGCAGGAGGTGGTGCCCTTCCTGGCCAGGCTCAGCAACAGGCTAAGCACATGTCA 
T AT TGAAGGTGATGAC C TGCAT AT CCAGAGGAATGTGCAAAAGC T GAAGGACACAGTGAAAAAGC 
TTGGAGAGAGTGGAGAGATCAAAGCAATT GGAGAACT GGAT TT GCTGTTTATGTCTCT GAGAAAT 
GCCT GCAT T TGACCAGAGCAAAGCT GAAAAAT GAAT AACTAAC CCCCT T TCCCT GCTAGAAATAA 
CAATTAGATGCCCCAAAGCGATTTTTTTTAACCAAAAGGAAGATGGGAAGCCAAACTCCATCATG 
ATGGGTGGATTCCAAATGAACCCCTGCGTTAGTTACAAAGGAAACCAATGCCACTTTTGTTTATA 
H§ AGACCAGAAGGTAGAC TT T CTAAGCATAGATAT T TAT T GATAACAT TTCAT TGT AACT G GT GT T C 

5 T AT AC AC AG AAAAC AA T T TAT T T T T T AAAT AAT TGTCTTTT TC C AT AAAAAAGAT T AC T T T CC AT 

O TCCTTTAGGGGAAAAAACCCCTAAATAGCTTCATGTTTCCATAATCAGTACTTTATATTTATAAA 

III 

— T G T AT T T AT TAT TAT T AT AAGAC T GCAT T T TAT T TAT AT CAT T T TAT T AAT AT G GAT T TAT T TAT 

AGAAACATCAT TCGATATTGCTACT TGAGTGTAAGGCTAAT AT TGATAT TTATGACAATAAT TAT 

ft AGAGCTATAACATGTTTATTTGACCTCAATAAACACTTGGATATCCC 

fey* 

m 



m 
ill 
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FIGURE 1*4 

MAALQKSVSS FLMGTLATSCLLLLALLVQGGASlAP I S SHCRLDKSNFQQPY I TNRT FMLAKEAS L 
ADNNTDVRLIGEKLFHGVSMSERCYLMKQVLNFTLEEVLFPQSDRFQPYMQEWPFLARLSNRLS 
TCHI E GDDLH I QRNVQKLKDTVKKLGE S GE IKAI GELDLL FMS LRNAC I 



Important features of the protein: 
Signal peptide: 

amino acids 1-33 

N-glycosylation sites. 

amino acids 54-58, 68-72, 97-101 



fit 

ri N-myristoylation sites. 

f|J amino acids 14-20, 82-88 



m 



Prokaryotic membrane lipoprotein lipid attachment site. 

amino acids 10-21 
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FIGURE 155 



GGCTTGCTGAAAATAAAATCAGGACTCCTAACCTGCTCCAGTCAGCCTGCTTCCACGAGGCCTGT 
CAGTCAGTGCCCGACTTGTGACTGAGTGTGCAGTGCCCAGCATGTACCAGGTCAGTGCAGAGGGC 
TGC C T GAG GGCT GTGC TGAGAGGGAGAGGAGCAGAGATGC T GCT GAGGGT GGAGG GAGG CCAAGC 
TGCCAGGTTTGGGGCTGGGGGCCAAGTGGAGTGAGAAACTGGGATCCCAGGGGGAGGGTGCAGAT 
GAGGGAGCGACCCAGATTAGGTGAGGACAGTTCTCTCATTAGCCTTTTCCTACAGGTGGTTGCAT 
TCTTGGCAATGGTCATGGGAACCCACACCTACAGCCACTGGCCCAGCTGCTGCCCCAGCAAAGGG 
j , CAGGACACCTCTGAGGAGCTGCTGAGGTGGAGCACTGTGCCTGTGCCTCCCCTAGAGCCTGCTAG 
In GCCCAACCGCCACCCAGAGTCCTGTAGGGCCAGTGAAGATGGACCCCTCAACAGCAGGGCCATCT 
Pi CCCCCTGGAGATATGAGTTGGACAGAGACTTGAACCGGCTCCCCCAGGACCTGTACCACGCCCGT 
CP TGCCTGTGCCCGCACTGCGTCAGCCTACAGACAGGCTCCCACATGGACCCCCGGGGCAACTCGGA 
M* GCTGCTCTACCACAACCAGACTGTCTTCTACAGGCGGCCATGCCATGGCGAGAAGGGCACCCACA 
%1 AGGGCTACTGCCTGGAGCGCAGGCTGTACCGTGTTTCCTTAGCTTGTGTGTGTGTGCGGCCCCGT 
m GTGATGGGCTAGCCGGACCTGCTGGAGGCTGGTCCCTTTTTGGGAAACCTGGAGCCAGGTGTACA 

: r~ " * 

3 ACCACTTGCCATGAAGGGCCAGGATGCCCAGATGCTTGGCCCCTGTGAAGTGCTGTCTGGAGCAG 
M CAGGATCCCGGGACAGGATGGGGGGCTTTGGGGAAAACCTGCACTTCTGCACATTTTGAAAAGAG 
J;^ CAGCTGCTGCTTAGGGCCGCCGGAAGCTGGTGTCCTGTCATTTTCTCTCAGGAAAGGTTTTCAAA 
I' s i GTTCTGCCCATTTCTGGAGGCCACCACTCCTGTCTCTTCCTCTTTTCCCATCCCCTGCTACCCTG 
fH GCCCAGCACAGGCACTTTCTAGATATTTCCCCCTTGCTGGAGAAGAAAGAGCCCCTGGTTTTATT 
III TGTTTGTTTACTCATCACTCAGTGAGCATCTACTTTGGGTGCATTCTAGTGTAGTTACTAGTCTT 

TTGACAT GGATGATTCTGAGGAGGAAGCTGTTATTGAAT G TATAGAGATTTATCCAAATAAATAT 

CTTTATTTAAAAATGAAAAA 
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FIGURE 1*6 

i4RERPRLGEDSSLISLFLQWAFLAMVMGTHTYSHWPSCCPSKGQDTSEELLRWSTVPVPPLEPA 
RPNRHPESCRASEDGPLNSRAISPWRYELDRDLNRLPQDLYHARCLCPHCVSLQTGSHMDPRGNS 
ELLYHNQTVFYRRPCHGEKGTHKGYCLERRLYRVSLACVCVRPRVMG 

Important features of the protein: 
Signal peptide: 

amino acids 1-32 

N-glycosylation site. 

amino acids 136-140 

Tyrosine kinase phosphorylation site. 

amino acids 127-135 

N-myristoylation sites. 

amino acids 44-50, 150-156 



10063602 
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FIGURE 1*7 



CCGGCG ATG TCGCTCGTGCTGCTAAGCCTGGCCGCGCTGTGCAGGAGCGCCGTACCCCGAGAGCC 
GACCGTTCAATGTGGCTCTGAAACTGGGCCATCTCCAGAGTGGATGCTACAACATGATCTAATCC 
CCGGAGACT TGAGGGACC TCCGAGTAGAACC T G T TACAAC T AG TGTT GCAACAGGGGACT AT TCA 
ATTTTGATGAATGTAAGCTGGGTACTCCGGGCAGATGCCAGCATCCGCTTGTTGAAGGCCACCAA 
GATT T GT G T GACGGGCAAAAGCAAC T T CCAGT CC TACAGCT GT GT GAGGTGCAAT T ACACAGAGG 
CCTTCCAGACTCAGACCAGACCCTCTGGTGGTAAATGGACATTTTCCTACATCGGCTTCCCTGTA 
GAGCTGAACACAGTCTATTTCATTGGGGCCCATAATATTCCTAATGCAAATATGAATGAAGATGG 
C CCT T CC AT G T C TG T GAAT T T C AC C T CAC C AGG C T G C C TAG AC C AC AT AAT G AAAT AT AAAAAAA 
p| AGTGT GT CAAGGCCGGAAGC CT GT GGGATCCGAACAT CAC T GC T TGTAAGAAGAATGAGGAGACA 

f|| G TAGAAGT GAACT TCACAACCAC T CC CCTGGGAAACAGATACAT GGCT CT TATCCAACACAGCAC 

III TATCATCGGGTTTTCTCAGGTGTTTGAGCCACACCAGAAGAAACAAACGCGAGCTTCAGTGGTGA 
V* TTCCAGTGACTGGGGATAGTGAAGGTGCTACGGTGCAGCTGACTCCATATTTTCCTACTTGTGGC 

AGCGACTGCATCCGACATA7\AGGAACAGTTGTGCTCTGCCCACAAACAGGCGTCCCTTTCCCTCT 
GGATAACAACAAAAGCAAGCCGGGAGGCTGGCTGCCTCTCCTCCTGCTGTCTCTGCTGGTGGCCA 
PI CATGGGTGCTGGTGGCAGGGATCTATCTAATGTGGAGGCACGAAAGGATCAAGAAGACTTCCTTT 
IH TCTACCACCACACTACTGCCCCCCATTAAGGTTCTTGTGGTTTACCCATCTGAAATATGTTTCCA 
**; TCACACAAT T T G T TACT TCACT GAAT T TC T T CAAAACCAT T GCAGAAG T GAGG T CATC CTT GAAA 

J;J AGTGGCAGAAAAAGAAAATAGCAGAGATGGGTCCAGTGCAGTGGCTTGCCACTCAAAA.GAAGGCA 
fjff GCAGACAAAGTCGTCTTCCTTCTTTCCAATGACGTCAACAGTGTGTGCGATGGTACCTGTGGCAA 

GAGCGAGGGCAGTCCCAGTGAGAACTCTCAAGACCTCTTCCCCCTTGCCTTTAACCTTTTCTGCA 
GTGATCTAAGAAGCCAGATTCATCTGCACAAATACGTGGTGGTCTACTTTAGAGAGATTGATACA 
AAAGACGATTACAATGCTCTCAGTGTCTGCCCCAAGTACCACCTCATGAAGGATGCCACTGCTTT 
CTGTGCAGAACTTCTCCATGTCAAGCAGCAGGTGTCAGCAGGAAAAAGATCACAAGCCTGCCACG 
ATGGCTGCTGCTCCTTGTAG 
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FIGURE 158 



in 



m 



ft* 



MSLVLLSLAALCRSAVPREPTVQCGSETGPSPEWMLQHDLIPGDLRDLRVEPVTTSVATGDYSILMNVSWV 
LRADASIRLLKATKICVTGKSNFQSYSCVRCNYTEAFQTQTRPSGGKWTFSYIGFPVELNTVYFIGAHNIP 
NANMNEDGPSMSVNFTSPGCLDHIMKYKKKCVKAGSLWDPNITACKKNEETVEVNFTTTPLGNRYMALIQK 
ST 1 1 G FSQVFEPHQ KKQTRASWI PVTGDSEGAT VQLT PY FPTCGSDC IRHKGT WLC PQTGVP FPLDNNK 
S KPGGWLPLLLLSLLVATWVLVAG I YLMWRHERI KKTS FSTTTLLP P I KVLVVY PSE IC FHHT IC YFTE FL 
QNHCRSEVILEKWQECKKIAEMGPVQWLATQKKAADKVVFLLSNDVNSVCDGTCGKSEGSPSENSQDLFPLA 
FNLFCSDLRSQI HLHKY VWY FRE IDTKDDYNALSVCPKY HLMKDATAFCAE LLHVKQQ VSAGKRSQACHD 
GCCSL 

Important features of the protein: 
Signal peptide: 

amino acids 1-14 



3£i Transmsnibrane doniaxn: 

amino acids 290-309 

||| N-glycosylation sites. 



amino acids 67 - 71, 103 - 107, 156 - 160, 183 - 187, 197 - 201 and 283 



%m - 287 



cAMP- and cGMP-dependent protein kinase phosphorylation sites. 

amino acids 228 - 232 and 319 - 323 



Casein kinase II phosphorylation sites. 

amino acids 178 - 182, 402 - 406, 414 - 418 and 453 - 457 

N-myristoylation site. 

amino acids 116-122 

Amidation site. 

amino acids 488-452 



A PP. 



ID= 



10063602 



Page 308 of 3 18 



FIGURE 15Q 



m 

m 



w 

;- • :: 



AGCCACCAGCGCAACATGACAGT GAAGAC CCT GCAT GGCCCAGCC AT GGT CAAG TAG X TGC TGCT 
GTCGATATTGGGGCTTGCCTTTCTGAGTGAGGCGGCAGCTCGGAAAATCCCCA7VAGTAGGACATA 
CTTTTTTCCAAAAGCCTGAGAGTTGCCCGCCTGTGCCAGGAGGTAGTATGAAGCTTGACATTGGC 
ATCATCAATGAAAACCAGCGCGTTTCCATGTCACGTAACATCGAGAGCCGCTCCACCTCCCCCTG 
GAATTACACTGTCACTTGGGACCCCT^ACCGGTACCCCTCGGAAGTTGTACAGGCCCAGTGTAGGA 
ACTTGGGCTGCATCAATGCTCAAGGAAAGGAAGACATCTCCATGAATTCCGTTCCCAT CCAGCAA 
GAGACCCTGGTCGTCCGGAGGAAGCACCAAGGCTGCTCTGTTTCTTTCCAGTTGGAGAAGGTGCT 
GGTGACTGTTGGCTGCACCTGCGTCACCCCTGTCATCCACCATGTGCAG TAA GAGGTGCA-TATCC 
ACTCAGCTGAAGAAG 
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FIGURE 160 

MTVKTLHGPAMVKYLLLSILGLAFLSEAAARKIPKVGHTFFQKPESCPPVPGGSMKLDIGIINEN 
QRVSMSRNIESRSTSPWNYTVTWDPNRYPSEWQAQCRNLGCINAQGKEDISMNSVPIQQETLW 
RRKHQGCSVSFQLEKVLVTVGCTCVTPVIHHVQ 



Signal sequence : 

amino acids 1-30 

N-glycosylation site. 

amino acids 83-87 



I i-3 

5f N-myristoylation sites. 



m 

T.r - 



m 



1 

: f 



amino acids 106-111, 136-141 
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FIGURE 161 



ACAC T GGCCAAACAAAAACGAAAGCACT CCG T GC TGGAAGT AGGAGGAGAGT CAGGACT CCCAGG 
ACAGAGAGTGCACAAACTACCCAGCACAGCCCCCTCCGCCCCCTCTGGAGGCTGAAGAGGGATTC 
CAGCCCCTGCCACCCACAGACACGGGCTGACTGGGGTGTCTGCCCCCCTTGGGGGGGGGCAGCAC 
AGGGCCTCAGGCCTGGGTGCCACCTGGCACCTAGAAG ATG CCTGTGCCCTGGTTCTTGCTGTCCT 
TGGCACTGGGCCGAAGCCCAGTGGTCCTTTCTCTGGAGAGGCTTGTGGGGCCTCAGGACGCTACC 
CACTGCTCTCCGGGCCTCTCCTGCCGCCTCTGGGACAGTGACATACTCTGCCTGCCTGGGGACAT 
CGTGCCTGGTCCGGGCCCCGTGCTGGCGCCTACGCACCTGCAGACAGAGCTGGTGCTGAGGTGCC 
AGAAGGAGACCGACTGTGACCTCTGTCTGCGTGTGGCTGTCCACTTGGCCGTGCATGGGCACTGG 
GAAGAGCC T GAAGAT GAGGAAAAG T T TGGAGGAGCAGCTGACT CAGGGGT GGAGGAGCC TAGGAA 
TGCCTCTCTCCAGGCCCAAGTGGTGCTCTCCTTCCAGGCCTACCCTACTGCCCGCTGCGTCCTGC 
TGGAGGTGCAAGTGCCTGCTGCCCTTGTGCAGTTTGGTCAGTCTGTGGGCTCTGTGGTATATGAC 
TGCT TCGAGGC T GCCC TAGGGAG T GAGGTACGAAT CT GGT C CT ATAC TCAGC CCAGG TACGAGAA 
GGAACTCAACCACACACAGCAGCTGCCTGCCCTGCCCTGGCTCAACGTGTCAGCAGATGGTGACA 
m ACGTGCATCTGGTTCTGAATGTCTCTGAGGAGCAGCACTTCGGCCTCTCCCTGTACTGGAATCAG 
7 I GTCCAGGGCCCCCCAAAACCCCGGTGGCACAAAAACCTGACTGGACCGCAGATCATTACCTTGAA 

CCACACAGACCTGGTTCCCTGCCTCTGTATTCAGGTGTGGCCTCTGGAACCTGACTCCGTTAGGA 
CGAACATCTGCCCCTTCAGGGAGGACCCCCGCGCACACCAGAACCTCTGGCAAGCCGCCCGACTG 
CGACTGCTGACCCTGCAGAGCTGGCTGCTGGACGCACCGTGCTCGCTGCCCGCAGAAGCGGCACT 
f|I GTGCTGGCGGGCTCCGGGTGGGGACCCCTGCCAGCCACTGGTCCCACCGC-TTTCCTGGGAGAACG 

TCACTGTGGACAAGGTTCTCGAGTTCCCATTGCTGAAAGGCCACCCTAACCTCTGTGTTCAGGTG 
1 AACAGCTCGGAGAAGCTGCAGCTGCAGGAGTGCTTGTGGGCTGACTCCCTGGGGCCTCTCAAAGA 
18 CGATGTGCTACTGTTGGAGACACGAGGCCCCCAGGACAACAGATCCCTCTGTGCCTTGGAACCCA 
|J! GTGGCTGTACTTCACTACCCAGCAAAGCCTCCACGAGGGCAGCTCGCCTTGGAGAGTACTTACTA 
jpj CAAGACCTGCAGTCAGGCCAGTGTCTGCAGCTATGGGACGATGACTTGGGAGCGCTATGGGCCTG 

CCCCATGGACAAATACATCCACAAGCGCTGGGCCCTCGTGTGGCTGGCCTGCCTACTCTTTGCCG 
W CTGCGCTTTCCCTCATCCTCCTTCTCAAAAAGGATCACGCGAAAGGGTGGCTGAGGCTCTTGAAA 
13 CAGGACGTCCGCTCGGGGGCGGCCGCCAGGGGCCGCGCGGCTCTGCTCCTCTACTCAGCCGATGA 
| ! || CTCGGGTTTCGAGCGCCTGGTGGGCGCCCTGGCGTCGGCCCTGTGCCAGCTGCCGCTGCGCGTGG 

CCGTAGACCTGTGGAGCCGTCGTGAACTGAGCGCGCAGGGGCCCGTGGCTTGGTTTCACGCGCAG 
CGGCGCCAGACCCTGCAGGAGGGCGGCGTGGTGGTCTTGCTCTTCTCTCCCGGTGCGGTGGCGCT 
GTGCAGCGAGTGGCTACAGGATGGGGTGTCCGGGCCCGGGGCGCACGGCCCGCACGACGCCTTCC 
GCGCCTCGCTCAGCTGCGTGCTGCCCGACTTCTTGCAGGGCCGGGCGCCCGGCAGCTACGTGGGG 
GCCTGCTTCGACAGGCTGCTCCACCCGGACGCCGTACCCGCCCTTTTCCGCACCGTGCCCGTCTT 
CACACTGCCCTCCCAACTGCCAGACTTCCTGGGGGCCCTGCAGCAGCCTCGCGCCCCGCGTTCCG 
GGCGGCTCCAAGAGAGAGCGGAGCAAGTGTCCCGGGCCCTTCAGCCAGCCCTGGATAGCTACTTC 
CATCCCCCGGGGACTCCCGCGCCGGGACGCGGGGTGGGACCAGGGGCGGGACCTGGGGCGGGGGA 
CGGGACT TAAATAAAGGCAGAC G CT GT T T T TCTAAAAAAA 
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FIGURE 162 



1 1! 



MPVPWFLLSLALGRSPWLSLERLVGPQDATHCSPGLSCRLWDSDILCLPGDIVPAPGPVLAPTHLQTELV 
LRCQPCETDCDLCLRVAVHLAVHGHWEEPEDEEKFGGAADSGVEEPRNASLQAQWLSFQAYPTARCVLLEV 
QVPAALVQFGQSVGSVVYDCFEAALGSEVRIWSYTQPRYEKELNHTQQLPALPWLNVSADGDOTHLVLWS 
EEQHFGLSLYWNQVQGPPKPRWHKNLTGPQIITLNHTDLVPCLCIQVWPLEPDSVRTNICPFREDPRAHQN 
LWQAARLRLLTLQSI^LDAPCSLPAEAALCWRAPGGDPCQPLVPPLSWEKVTVDE<VLEFPLLKGHP3^LCVQ 
VNSSEKLQLQECLWADSLGPLKDDVLLLETRGPQDNRSLCALEPSGCTSLPSKASTRAARLGEYLLQDLQS 
GQCLQLWDDDLGALWAC PMDKY I H KRWALVWL ACLL FAAALSLT LLLKKDHAKGWL RLLKQDVRSGAAARG 
RAALLL Y SADDSG FE RLVGALASALCQL PLRVAVDLWS RREL SAQGPVAW FHAQ RRQTLQEGGWVLL FS P 
GAVALCSEWLQDGVSGPGAHGPHDAFRASLSCVLPDFLQGRAPGSYVGACFDRLLHPDAVPALFRTVPVFT 
LPSQLPDFLGALQQPRAPRSGRLQERAEQVSRALQPALDSYFHPPGTPAPGRGVGPGAGPGAGDGT 



\U Signal sequence: 

^ l amino acids 1-20 



Transmembrane domain. 

amino acids 453-475 

N-glycosylation sites. 

amino acids 118-121, 186-189, 198-201, 211-214, 238-241, 248-251, 
334-337, 357-360, 391-394 

Glycosaminoglycan attachment site. 

amino acids 583-586 



cAMP- and cGMP-dependent protein kinase phosphorylation site. 

amino acids 552-555 



N-myristoylation sites. 

amino acids 107-112, 152-157, 319-324, 438-443, 516-521, 612-617, 
692-697, 696-701, 700-705 
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FIGURE 163 



GGGAGGGCTCTGTGCCAGCCCCG ATGA GGACGCTGCTGACCATCTl'GACTGTGGGATCCCTGGCT 

GCTCACGCCCCTGAGGACCCCTCGGA.TCTGCTCCAGCACGTGAAATTCCAGTCCAGCAACTTTGA 

AAACAT C CT GACGT GGGACAGC GGGCCAGAGGGCACC CCAGAC ACGGTCTACAGCAT C GAG TATA 

AGACGTACGGAGAGAGGGACTGGGTGGCAAAGAAGGGCTGTCAGCGGATCACCCGGAAGTCCTGC 

AACCTGACGGTGGAGACGGGCAACCTCACGGAGCTCTACTATGCCAGGGTCACCGCT 

GTCAGTGCGGGAGGCCGGTCAGCCACCAAGATGACTGACAGGTTCAGCTCTCTGCAGCACACTAC 

CCT CAAGCC ACC TGATGTGACCT GTAT CTCCAAAG T GAGAT CGAT TCAGAT GAT T GT T CATCC T A 

CCCCCACGCCAATCCGTGCAGGCGATGGCCACCGGCTAACCCTGGAAGACATCTTCCATGACCTG 

T TC TACCACT TAGAGC TCCAGGT C AACCGCACC TACCAAAT GCACC T TGGAGGGAAGC AGAGAGA 

ATATGAGTTCTTCGGCCTGACCCCTGACACAGAGTTCCTTGGCACCATCATGATTTGCGTTCCCA 

CCT GGGC CAAGGAGAG TGCC CC C T AC ATG T GCCGAG T GAAGAC ACTGCCAGACC GGACAT GGACC 

TACTCCTTCTCCGGAGCCTTCCTGTTCTCCATGGGCTTCCTCGTCGCAGTACTCTGCTACCTGAG 

CTACAGATATGTCACCAAGCCGCCTGCACCTCCCAACTCCCTGAACGTCCAGCGAGTCCTGACTT 

TCCAGCCGCTGCGCTTCATCCAGGAGCACGTCCTGATCCCTGTCTTTGACCTCAGCGGCCCCAGC 

AGTCTGGCCCAGCCTGTCCAGTACTCCCAGATCAGGGTGTCTGGACCCAGGGAGCCCGCAGGAGC 

TCCACAGCGGCATAGCCTGTCCGAGATCACCTACTTAGGGCAGCCAGACATCTCCATCCTCCAGC 

CCTCCAACGTGCCACCTCCCCAGATCCTCTCCCCACTGTCCTATGCCCCAAACGCTGCCCCTGAG 

GTCGGGCCCCCATCCTATGCACCTCAGGTGACCCCCGAAGCTCAATTCCCATTCTACGCCCCACA 

GGCCATCTCTAAGGTCCAGCCTTCCTCCTATGCCCCTCAAGCCACTCCGGACAGCTGGCCTCCCT 

CCTATGGGGTATGCATGGAAGGTTCTGGCAAAGACTCCCCCACTGGGACACTTTCTAGTCCTAAA 

CACCTTAGGCCTAAAGGTCAGCTTCAGAAAGAGCCACCAGCTGGAAGCTGCATGTTAGGTGGCCT 

TTCTCTGCAGGAGGTGACCTCCTTGGCTATGGAGGAATCCCAAGAAGCAAAATCATTGCACCAGC 

C CCTGGGGAT T T GCACAGACAGAACATCTGACC CAAAT GT GCT ACACAGTGGGGAGGAAGGGACA 

CCACAGTACCTAAAGGGCCAGCTCCCCCTCCTCTCCTCAGTCCAGATCGAGGGCCACCCCATGTC 

CCTCCCTTTGCAACCTCCTTCCGGTCCATGTTCCCCCTCGGACCAAGGTCCAAGTCCCTGGGGCC 

TGCTGGAGTCCCTTGTGTGTCCCAAGGATGAAGCCAAGAGCCCAGCCCCTGAGACCTCAGACCTG 

GAGCAGCCCACAGAACTGGATTCTCTTTTCAGAGGCCTGGCCCTGACTGTGCAGTGGGAGTCCTG 

AGGGGAATGGGAAAGGCTTGGTGCTTCCTCCCTGTCCCTACCCAGTGTCACATCCTTGGCTGTCA 

ATCCCATGCCTGCCCATGCCACACACTCTGCGATCTGGCCTCAGACGGGTGCCCTTGAGAGAAGC 

AGAGGGAGTGGCATGCAGGGCCCCTGCCATGGGTGCGCTCCTCACCGGAACAAAGCAGCATGATA 

AGGACTGCAGCGGGGGAGCTCTGGGGAGCAGCTTGTGTAGACAAGCGCGTGCTCGCTGAGCCCTG 

C AAG GC AGAAAT GACAG T G CAAG G AGG AAAT GCAGG GAAAC T C CCGAGG T CC AGAG CC CC AC C T C 

CTAACACCATGGATTCAAAGTGCTCAGGGAATTTGCCTCTCCTTGCCCCATTCCTGGCCAGTTTC 

ACAATCTAGCTCGACAGAGCATGAGGCCCCTGCCTCTTCTGTCATTGTTCAAAGGTGGGAAGAGA 

GCCTGGAAAAGAACCAGGCCTGGAAAAGAACCAGAAGGAGGCTGGGCAGAACCAGAACAACCTGC 

ACTTCTGCCAAGGCCAGGGCCAGCAGGACGGCAGGACTCTAGGGAGGGGTGTGGCCTGCAGCTCA 

TTCCCAGCCAGGGCAACTGCCTGACGTTGCACGATTTCAGCTTCATTCCTCTGATAGAACAAAGC 

GAAAT GCAGGTCCACCAGGGAGGGAGACACACAAGCCT TT TCTGCAGGCAGGAGT T TCAGACCCT 

ATCCTGAGAATGGGGTTTGAAAGGAAGGTGAGGGCTGTGGCCCCTGGACGGGTACAATAACACAC 

TGTACTGATGTCACAACTTTGCAAGCTCTGCCTTGGGTTCAGCCCATCTGGGCTCAAATTCCAGC 

CTCACCACTCACAAGCTGTGTGACTTCAAACAAATGAAATCAGTGCCCAGAACCTCGGTTTCCTC 

ATCTGTTVATGTGGGGATCATAACACCTACCTCATGGAGTTGTGGTGAAGATGAAATGAAGTCATG 

T C TT T AAAGT GC T TAATAG T GCCT GGTACAT GGGCAGT GCCCAATAAACGGTAGC TAT T T AAAAA 

AAAAAAAA 
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FIGURE 164 



MRTLLTILTVGSLAAHAPEDPSDLLQHVKFQSSNFENILTWDSGPEGTPDTVYSIEYKTYGERDW 
VAKKGCQRITRKSCKLTVETGNLTELYYARVTAVSAGGRSATKMTDRFSSLQHTTLKPPDVTCIS 
KVRSIQMIVHPTPTPIRAGDGHRLTLEDIFHDLFYHLELQVNRTYQMHLGGKQREYEFFGLTPDT 
EFLGTIMICVPTWAKESAPYMCRVKTLPDRTWTYSFSGAFLFSMGFLVAVLCYLSYRYVTKPPAP 
PNSLNVQRVLTFQPLRFIQEHVLIPVFDLSGPSSLAQPVQYSQIRVSGPREPAGAPQRHSLSEIT 
YLGQPDISILQPSWPPPQILSPLSYAPNAAPEVGPPSYAPQVTPEAQFPFYAPQAISPCVQPSSY 
APQATPDSWPPSYGVCMEGSGKDSPTGTLSSPKHLRPKGQLQKEPPAGSCMLGGLSLQEVTSLAM 
EESQEAKSLHQPLGICTDRTSDPNVLHSGEEGTPQYLKGQLPLLSSVQIEGHPMSLPLQPPSGPC 
SPSDQGPSPWGLLESLVCPKDEAKSPAPETSDLEQPTELDSLFRGLALTVQWES 

Signal sequence. 

amino acids 1-17 

Transmembrane domain. 

amino acids 233-250 

N-glycosylation sites. 

amino acids 30-83, 87-90, 172-175 

N-myristoylation sites. 

amino acids 11-16, 47-52, 102-107, 531-536, 565-570 
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FIGURE 16* 



JUL 

XGGCCT AC T GGAAAAAAAAAAAAAAAAAAAAAAAG TCACCC GGGCCCGCGGT GGCCAC AAC ATGG 
CTGCGGCGCCGGGGCTGCTCTTCTGGCTGTTCGTGCTGGGGGCGCTCTGGTGGGTCCCGGGCCAG 
TCGGATCTCAGCCACGGACGGCGTTTCTCGGACCTCAAAGTGTGCGGGGACGAAGAGTGCAGCAT 
GTTAATGTACCGTGGGAAAGCTCTTGAAGACTTCACGGGCCCTGATTGTCGTTTTGTGAATTTTA 
AAAAAGGTGACGATGTATAT GT CTACTACAAAC TGGCAGGGGGATCCC TTGAACTTTGGGCTGGA 
AG T G T T G AACAC AG T T T T G GAT AT T T T C C AAAAGAT T T GAT CAAGG T AC T T C AT AAAT AC ACG GA 
AGAAG AGCT ACATAT TCC AGCAGATG AGACAGAC TTTGTCTGCTTT GAAGGAG GAAGAG ATGAT T 
T T AAT AGTT ATAATGTAGAAGAGCT T T TAGGATC TT T GGAAC T GGAGGAC TC TGT ACC TGAAGAG 
TCGAAGAAAGCTGAAGAAGTTTCTCAGCACAGAGAGAAATCTCCTGAGGAGTCTCGGGGGCGTGA 
ACTT GACCC TGTGCCT GAGC CC GAGGCAT TCAGAGCT GAT T CAGAGGAT GGAGAAGGT GC T T TCT 
CAGAGAGCACCGAGGGGCTGCAGGGACAGCCCTCAGCTCAGGAGAGCCACCCTCACACCAGCGGT 
CCTGCGGCTAACGCTCAGGGAGTGCAGTCTTCGTTGGACACTTTTGAAGAAATTCTGCACGATAA 
Jjf ATTGAAAGTGCCGGGAAGCGAAAGCAGAACTGGCAATAGTTCTCCTGCCTCGGTGGAGCGGGAGA 
**" AGACAG ATGCT T ACAAAG T CCT GAAAACAGAAAT GAGT CAGAGAGGAAGTGGACAGTGCGT TAT T 

f*\ CAT TACAGCAAAGGAT T T CGTT GGCAT CAAAATC TAAG T T T G T T T TACAAAGAT T GT T T TTAGTA 

Iff CTAAGCTGCCTTGGC^GTTTGCATTTTTGAGCCAAACAAAAATATATTATTTTCCCTTCTAAGTA 
Q AAAAAAAAAAAAAAAAAAAA 



m 
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FIGURE 166 



MAAAPGLLFWLFVLGALWWVPGQSDLSHGRRFSDLKVCGDEECSMLMYRGKALEDFTGPDCRFVN 
FKKGDDVYVYYKLAGGSLELWAGSVEHSFGYFPKDLIKVLHKYTEEELHIPADETDFVCFEGGRD 
DFNSYNVEELLGSLELEDSVPEESKKAEEVSQHREKSPEESRGRELDPVPEPEAFRADSEDGEGA 
FSESTEGLQGQPSAQESHPHTSGPAANAQGVQSSLDTFEEILHDKLKVPGSESRTGNSSPASVER 
EKT DAYKVLKTEMS QRGS GQC V I HY SKGFRWHQNL S L FYKDC F 



Important features of the protein: 



Signal peptide: 

p amino acids 1-22 



n 



N-glycosylation site. 

amino acids 294-298 

cAMP- and cQtiP-dependent protein kinase phosphorylation site. 

amino acids 30-34 

Tyrosine kinase phosphorylation site. 

amino acids 67-7 6 

N-myristoylation sites. 

amino acids 205-211, 225-231, 277-283 



Amidation site. 

amino acids 28-32 
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FIGURE 167 



CCAGGACCAGGGCGCACCGGCTCAGCCTCTCACTTGTCAGAGGCCGGGGAAGAGAAGCAAA.GCGC 
AACGGTGTGGTCCAAGCCGGGGCTTCTGCTTCGCCTCTAGGACATACACGGGACCCCCTAACTTC 
AGTCCCCCAAACGCGCACCCTCGAAGTCTTGAACTCCAGCCCCGCACATCCACGCGCGGCAGAGG 
CGCGGCAGGCGGCAGGTCCCGGCCGAAGGCGATGCGCGCAGGGGGTCGGGCAGCTGGGCTCGGGC 
GGCGGGAGTAGGGCCCGGCAGGGAGGCAGGGAGGCTGCATATTCAGAGTCGCGGGCTGCGCCCTC 
GGCAGAGGCCGCCCTCGCTCCACGCAACACCTGCTGCTGCCACCGCGCCGCGATGAGCCGCGTGG 
TCTCGCTGCTGCTGGGCGCCGCGCTGCTCTGCGGCCACGGAGCCTTCTGCCGCCGCGTGGTCAGC 
GGCCAAAAGGTGTGTTTTGCTGACTTCAAGCATCCCTGCTACAAAATGGCCTACTTCCATGAACT 
GTCCAGCCGAGTGAGCTTTCAGGAGGCACGCCTGGCTTGTGAGAGTGAGGGAGGAGTCCTCCTCA 
GCCT T GAGAATGAAGCAGAACAGAAGT TAATAGAGAGCAT GT T GCAAAAC C TGACAAAACCCGGG 
ACAGGGATTTCTGATGGTGATTTCTGGATAGGGCTTTGGAGGAATGGAGATGGGCAAACATCTGG 
TGCCTGCCCAGATCTCTACCAGTGGTCTGATGGAAGCAATTCCCAGTACCGAAACTGGTACACAG 
ATGAACCT TCC T GC GGAAG T GAAAAGTGTGT T G T GAT GT AT CACCAAG CAACT GCCAAT CCT GGC 
CTTGGGGGTCCCTACCTTTACCAGTGGAATGATGACAGGTGTAACATGAAGCACAATTATATTTG 
CAAG TAT GAACCAGAGAT TAAT CC AACAGCCCC TGTAGAAAAGCCT TAT C T TACAAAT CAACCAG 
GAGACACCCATCAGAATGTGGTTGTTACTGAAGCAGGTATAATTCCCAATCTAATTTATGTTGTT 
ATACCAACAATACCCCTGCTCTTACTGATACTGGTTGCTTTTGGAACCTGTTGTTTCCAGATGCT 
GCATAAAAGTAAAGGAAGAACAAAAACTAGTCCAAACCAG T CT ACAC T GT GGAT T TCAAAGAGTA 
CCAGA2\AAGAAAGTGGCATGGAAGT ATAA TAACTCATTGACTT GGTTCCAGAAT TTTGTAAT TCT 
G GAT C T G T ATAAGGAAT GG C AT CAG AACAAT AG C T T G G AAT G G C T TGAAAT C AC AAAG GAT C T GC 
AAGAT GAAC TGTAAGC T C CCCC T TGAGGCAAATAT TAAAGT AAT T TT TATAT GT C TAT TAT T TCA 
TTTAAAGAATATGCTGTGCTAATAATGGAGTGAGACATGCTTATTTTGCTAAAGGATGCACCCAA 
ACT TCAAACT T CAAGCAAATGAAATGGACAATGCAGATAAAGT TGTTATCAACA-CGTCGGGAGTA 
TGTGTGTTAGAAGCAATTCCTTTTATTTCTTTCACCTTTCATAAGTTGTTATCTAGTCAATGTAA 
TGTATAT T GTAT TGAAAT TTACAGTGTGCAAAAGTATTTTACCT TTGCATAAGT GTTTGATAAAA 
ATGAAC T G T T C T AATAT T TAT T T T TAT GGCAT C T CAT T T T T C AAT ACA T GC T C T T T TGAT T AAAG 
AAACTTAT TACTGTTGTCAACT GAAT TCACACACACACAAATATAGTACCATAGAAAAAGT TTGT 
TTTCTCGAAATAATTCATCTTTCAGCTTCTCTGCTTTTGGTCAATGTCTAGGAAATCTCTTCAGA 
AATAAGAAGCTAT TT CAT TAAGTGT GATATAAACCTCCT CAAACAT T T TACT TAGAGGCAAGGAT 
TG TCTAATT TCAATTGTGCAAGACAT GTGCCTTATAATTAT TTT TAGC TTAAAAT TAAACAGATT 
T TGTAAT AATGTAACT T TG TTAATAGGTGCATAAACAC T AATGCAGTCAATT T GAACAAAAGAAG 
TGACATACACAAT ATAAAT CATAT GT C TT CACACGT T GCC TAT ATAAT GAGAAGCAGC TCTC TGA 
GGGTTCTGAAATCAATGTGGTCCCTCTCTTGCCCACTAAACAAAGATGGTTGTTCGGGGTTTGGG 
AT T GACAC T GGAGG C AGATAG T T GCAAAG T TAG T C T AAGG T T T CCC T AGC T G TAT T TAG CC T C T G 
ACT AT AT TAG TAT ACAAAGAGGT CAT G TGGT T GAGACCAGGTGAATAGTCACTATCAGT GT GGAG 
ACAAGCACAGCACACAGACATTT TAGGAAGGAAAGGAACTACGAAATCGTGTGAAAAT GGGT TGG 
AACCCATCAGTGATCGCATATTCATTGATGAGGGTTTGCTTGAGATAGAAAATGGTGGCTCCTTT 
CTGTCTTATCTCCTAGTTTCTTCAATGCTTACGCCTTGTTCTTCTCAAGAGAAAGTTGTAACTCT 
CTGGTCTTCATATGTCCCTGTGCTCCTTTTAACCAAATAAAGAGTTCTTGTTTCTGGGGGAAAAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGURE 168 



MSRWSLLLGAALLCGHGAFCRRWSGQKVCFADFKHPCYKMAYFHELSSRVSFQEARLACESS 
GGVLLSLENEAEQKLIESMLQNLTKPGTGISDGDFWIGLWRNGDGQTSGACPDLYQWSDGSNSQ 
YRNWYTDEPSCGSEKCWMYHQPTANPGLGGPYLYQWNDDRCNMKHNYICKYEPEINPTAPVEK 
PYLTNQPGDTHQNVWTEAGI I PNL I YWI PTI PLLLL I LVAFGTCCFQMLHKSKGRTKTS PNQ 
STLWI SKSTRKE SGMEV 



Important features of the protein: 
Signal peptide: 

amino acids 1-21 

J54 Transmembrane domain: 

111 amino acids 214-235 

w 

^ s N-glycosylation sites. 

I** amino acids 8 6-89, 255-258 



cAMP- and cGMP -dependent protein kinase phosphorylation site. 



PI amino acids 266-2 69 

*** 



N-myristoylation sites. 

amino acids 27-32, 66-71, 91-96, 93-98, 102-107, 109-114, 140- 
145, 212-217 



App_ID=10063602 
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